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(57) Abstract 

The present invention provides for a method of updating 
an internet search engine database with the results of a user's 
selection of specific web page listings from the general web page 
listing provided to the user as a result of his initial keyword 
search entry. By updating the database widi the .selections of 
many different users, the database can be updated to prioritize 
those web listings that have been selected the most wltti respect 
to a given keyword, and thereby presenting first the most popular 
web page listings in a subsequent search using the same keyword 
search entry. 
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IMPROVED SEARCH ENGINE 



PCT/US99/05588 



Field of the hvention 

The present invention relates to a method and apparatus that allows for enhanced database 
searching, and more particularly, for use as an internet search engine. 

5 Background of flie Related Art 

An efBcient and practical means of obtaining relevant information and also screening 
unwanted/uninteresting information has been an ongoing need, especially since tiie incq)tion of &e 
internet. This need is particularly acute at present due to the exponential growth in &e number of world- 
wide web sites and the sheer volume of information contained therein. In an attempt to index the 

10 information available on the internet, a numb^ of software search engines have been created via which a 
user enters a search command comprised of suitable keywords from a keyboard at his personal computer. 
The search command is transmitted to a server computer, that has a search engine associated with the 
server computer. The search engine receives the search command, and then using it scans for these key 
words through a database of web addresses and the text stored on the web sites. Thereafter, the results of 

15 the scan are transmitted from the server computer back to the user's computer and displayed on the 
screen of the user's computer. 

In order for the search engine to be aware of new web sites and to update its records of existing 
sites, cither the proprietors of tiie web sites notify the search engine themselves or the information may 
be obtained via a 'web crawler' to update the database at the server computer. A web crawler is an 
20 automated program which explores and records the contents of a web site and its links to other sites, 
thereby spreading between sites hi an attempt to index all the current sites. 

This database structure and method of searching it poses some significant difficulties. The nitemet 
growfli-rate has resulted in a substantial backlog in the scaiming of new sites, notwitiiistanding die fact 
fliat web sites are frequently deleted, re-addressed, updated and so forth, thus leaving the search engine 

25 with outdated and/or misleading information. Although the web crawlers can be configured to prioritize 
possible key-words according to their location (title, embedded link, address etc), nevertheless, 
depending on the type of search engine used, substantial portions of the web site text (often involving the 
majority or even all of the site text) is still required to be scanned. This results in colossal storage 
requirements for the search engine. Furtiiermore, a typical key word search may bring up an excessively 

30 large volume of material, the majority of which may be of little interest to the user. The user typically 
makes a selection from the list based on the brief descriptions of the site and explores the chosen sites 
until the desired information is located. 

These results are ha the form of a list, ranked according to criteria specific to the search engine. 
These criteria may range from the niimber of occurrences of the key-words anywhere within the searched 
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text, to methods giving a weighting to key-words used in particular positions (as previously mentioned). 
When multiple key-words have been used, sites are also ranked according to the number of different key- 
words applicable. A fundamental drawback of all these ranking systems is their objectivity - they are 
determined according to the pr o gra m med criteria of ttie search engine, and the emphasis placed on 
5 particular types of site design, rather ftan any measure of fiie actual users' opinions. Indeed this can lead 
to the absurd situation whereby in an attempt to ensure a &vorable rating by the most commonly used 
search engines, some designers deliberately configure their sites in the light of the previously mentioned 
criteria, to ttie detriment of the presentation, readability and content of the site. 

SUMMARY OF THE INVENTION 

10 

It is an object of the present invention to ameliorate the aforementioned disadvantages of 
conventional search engines by harnessing Ae cerebral power of the human operator 

It is a further object of the present invention to provide a novel search engine with enhanced 
efHciency, usability and effectiveness with a reduced system storage and/or computational requirements 
15 in comparison to existing software engines. 

It is a further object of the present invention to provide a variety of indications of the populari^ of 
the search data, together with an indication of its date of creation or updating. 

In order to obtain the above recited advantages of the present invention, among others, one 
embodiment of the present invention provides for a method of updating an internet search engine 
20 database with the results of a user's selection of specific web page listmgs from the general web page 
listing provided to the user as a result of his initial keyword search entry. By updating the database with 
the selections of many different users, the database can be updated to prioritize those web listings that 
• have been selected the most with respect to a given keyword, and thereby presenting first the most 
popular web page listings in a subsequent search using the same keyword search entry. 

25 In another embodiment of the present invention, a method of determining content to provide along 

with listings transmitted firom a server computer to user sites is provided. In this embodiment, there is 
obtained a content listing fiom each one of a plurality of different developer sites. Each of the content 
listings includes content, a developer identifier, and a keyword, and a keyword selection factor. 
ThCTeafter, there is determined a particular keyword from tiie obtained keywords that is the same for 

30 different content listings. For that particular keyword, the keyword selection factor is used in 
determining when to transmit different content listings to the user sites. 

In still another embodiment, there is provided a method of updatmg a keyword table with the results 
of a user's selection of specific keywords which were obtained from a list of related keywords presented 
to the user. By updating the database with selections of many different users associated with that same 
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keyword, appropriate keywords can be provided and presented first when that same keyword is 
subsequently entered. 

BRIEF DESCRIPTION OF THE DRAWINGS 

These and other advantages of the present invention may be appreciated fiom studying the following 
detailed description of the preferred embodiment together widi the drawings in which: 
Fig. 1 illustrates certain of the overall features of the present invention; 

Fig. 2 illustrates various inputs to the search, and, for each of the different capabilities, illustrates tiie 
outputs that will be provided engine according to the present invention; 

Figs. 3 A and 3B illustrates an overview of the process by which web pages are selected in making 
up the search results provided to the end user according to the present invention; 

Fig. 4 illustrates the data sets used for different web-page searches according to the present 
invention. 

Fig. S shows the various data sets previously described, and various inputs and actions that result m 
a list of suggested web pages being provided according to the present invention; 

Fig. 6 illustrates the implementation of a popular search according to the present invention: 

Fig. 7 illustrates the implementation of a hot off the press search according to the present invention: 

Fig. 8 illustrates the implementation of a high-flyers search according to the present invention: 

Fig. 9 illustrates the implementation of a random search accordmg to the present invention: 

20 Fig 10 illustrates the implementation of a previous past favorites search according to the present 

invention. 

Fig. 1 1 illustrates the implementation of a collective search according to the present invoition. 

Fig 12 illustrates the implementation of a date created search according to the present invention. 

Fig 13 illustrates the implementation of a customized search according to the present invention. 

25 Fig 14 illustrates the implementation searching based upon a group identity according to the 

present invention. 

Fig. 15 illustrates a keyword eliminator feature according to the present invention. 

Fig 16 illustrates the process of determining which search results should be used to make up the 
cumulative surfer trace table according to the present invention. 

30 Fig 17 illustrates active suggestion of web pages according to the present invention. 
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Fig 18 illustrates passive suggestion of web pages according to the present invention. 

Fig. 19 provides an overview of suggesting keywords according to the presoit invention. 

Fig. 20 illustrates the manner of creating data sets for suggested keywords according to tiie piesoit 
invention. 

5 Fig. 21 illustrates a variety of manners in which a list of suggested keywords can be created 

according to the present invention. 

Fig. 22 illustrates how content is attached to web page listings according to the present invention. 

Fig. 23 illustrates various content data sets and operations that populate them according to the 
present invention. 

10 Fig. 24 illustrates various content data sets and operations that are used to select data from them a 

according to fee present mvention. 

Fig. 25 illustrates web page listings and ofeer content data according to the present invention 

DETAILED DESCRIFTION OF THE PREFERRED EMBODIMENT 

15 Figs. lA and IB illustrate certain of the overall features of fee present mvention, which will be 

described in furfeer detail hereinafter. It is initially noted that like- numbered reference numerals in 
various Figures and descriptions will be used in fee following descriptions to refer to fee same or similar 
structures, actions or process steps. 

The present invention is preferably implemented in a network environment wherein each 
20" "'C'oixiputef^ontwJ;"^icall^^ memory, nMd~mcKienirana~cor&in"ortfe " " " 

contain displays and fee like, as are well known. As shown in Fig. IB, a plurality of user 
sites/computers lOOA-lOOD are shown, as are a plurality of server computers 102A-B, and developer 
sites/computers 104A-B. It is understood feat in a typical internet network, that different server 
computers 102 can be interconnected togefeer, as is illustrated. Fuzfeer, while only a few user sites, 
25 developer sites and server computers are shown, it is understood that thousands of such computers are 
interconnected togefeer. 

While fee specific embodiments of fee present invention are written for applications in which fee 
invention is implemented as sequences of coded program instructions operated upon by a server 
computer 102 as illustrated, it will be understood that certain sequences of feese program instructions 
30 could instead be implemented in ofeer foims, sudi as processors having specific instructions specifically 
tailored for fee applications described hereinafter. 
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As will be illustrated hereinafter, additional operations, transparent to the user, are inq}lemented in 
order to obtain search results in the future based upon currently made searches. As shown, die present 
invention has various capabilities, each of which are illustrated in a parallel flow in Fig. lA, which 
illustrates an overview of fhe different capabilities that can be ongoing simultaneously. In teems of 
5 overall capabilities, start block 10 show three: suggesting web pages 12, suggesting keywords 14, and 
content suggestion 16. 

In order for web pages 12 to be selected by a user according to the present invention, there is a step 
18 in which the type of search to be performed is selected. Thereafter, in step 20, search input obtained 
from one of a variety of sources is input and used along witti the algorithm selected in step 18 to 
10 determine search results. The results of this search are then displayed to the user, as shown by steps of 
displaying a created list of web pages, displaying passively suggested web pages, and displaying actively 
suggested web pages, identified as steps 22, 24. and 26, respectively, in Figure 1. This capability, and 
how it is implemented, will be described in more detail hereinafter. 

In order for keyword suggestion to take place, which the user may or may not select, there is 
15 preferably an initial step 28 in which the type of keyword search algorithm to use is selected. Although 
many systems may have only one such algorithm, various ones, as described hereinafter, are possible. 
Once flie keyword search algorithm is selected, step 30 follows in which, based upon a ikeyword entered 
by a user, the current set of keyword data is operated upon to determine associated keywords. The results 
of fliis operation are then displayed to flie user in 30. This capability, and how it is implemented, will be 
20 described in more detail hereinafter. 

The previously mentioned web page and keyword selection capabilities inured to the direct benefit of 

the end user. -Anotfier-tiovel-feature-of the- present4nv«itionv wbich-lndirectly-inures to theH)aiefit of the- 

end user, directly benefits the advertiser, because it allows for content to be targeted in real time based 
upon various criteria. As will be described more fully hereinafter, a content providing algorithm is 
25 initially selected which wfll determine howcontwit is selected in step 34. Step 36 follows, and based 
upon xaputs from users and content providers, which content to show is determined. Thereafter, the 
advertisements are displayed for the user to see, simultaneously with the display of cither keywords 
and/or web pages 

While Fig 1 illustrates certain overall features according to the present invention, many of the 
30 advantageous features of the present invention are not, as mentioned previously, observable to tiie user, 
but instead transparent to user. They are. however, significant in order to fully explain how the present 
invention is implemented and are explained hereinafter. 

Fig. 2 is provided to illustrate various inputs to the search engine according to the present invention, 
and, for different capabilities, illustrates the outputs that will be provided More detailed explanations are 
35 provided hereinafter. Data that is potentially input from search engine user include: 
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• keyword 52 - this is the word or phrase that the user enter to find a list of web pages 

• profile types 54 - these are the groups of people they associate them selves with e.g. US, male, 
doctor etc. 

• user ID 56 - this is a unique identification for each user Aat chooses to register with the search 
5 engine. This can be done via a cookie or logon. 

• search type 58 - this can be actively chosen by the searcher to determine the type of search results 
they would like (popular, new, etc) 

• date-time 60 - this is passively recorded when a searcher uses the system 

• ff address 62 - this is passively recorded when a searcher uses the system 

10 • other 64 - this includes other personalization information such as search customization preferences, 
keywords for web page suggestion etc. This information is entered actively once by the user then 
used to personalize the search results each time the users (identified by user BD) uses the search 
engine. 

Data from web-page developers include: 
15 • URL 66 - this is the URL address of the web page or pages diat they wish to submit 

• description 68 - this is a 2-3 line description of the information on their web-page 

• keywords 70 - these are the keywords that the web page developer would like to associate their web- 
page with _ _ _ _ . _ _ 

• target audience 72 - these are the target audience (profile types 54) that the web page developer 
20 particularly want to target 

• date-time 74 - this is passively recorded when every a web-page developer submits a web page 
Data from content providers include: 

• bids 76 - these arc S bids for content as described later. 

• content details 78 -this includes all details of content providers including address, content details 
25 etc. 

Results from other search engines 80 — these are the results for a keyword search from other existing 
search engines. 
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Outputs of the search engine 10 are: 

lists of web pages 90 - depending on the input data a iist of web pages can be produced in web page 
determination step 82, described further hereinafter; 

content keywords 92 - the search engine suggests other keywords for users to tiy produced in key word 
5 determination step 84, described further hereinafter; and 

content 94 - the se^h engine sendis out selected content as produced in determine content step 86, 
described further hereinafter 

To facilitate ease of reference and aid understanding, the aforementioned and subsequently mentioned 
data-set definitions are reiterated and expanded upon below (and where appropriate, the structure of the 
10 dependant data-sets used to create the defined data-set are shown in tabular form) with reference to the 
preferred embodiment of the present invention. Thereafter, certain of these will be explained in even 
greater detail to fully teach how to make and use the present invention. 

Locations: a plurality of unique information entities. 

Web-pages: Locations in the form of Web-pages URL (Universal Reference Locator) addresses. 

1 5 Key-word: The word or phrase that is entered in the search engine 

Hit-list: The list of web-pages (URL addresses) that is the result of the key-word search. This hit-list 
ranks the relevance of the web-pages relative to the key-word. This hit-list always has a key-word 
associated with it. 



Input data set 


Output data set 


• Key-word (temporary) 

• Database to match the key-word widi 
(permanent) 


• Hit-list - Ranked hit-list of Web-pages 
(temporary) 



20 

Permanent data set. Retained long term (although it changes over time 
Temporary data set: Created only for the duration of the search 

Surfer trace: This is a measure of how users search. It is a trace of the key words they search for, the 
URLs subsequently selected and how long they spend diere, from which a rankmg of web-pages for a 
25 \3sm (surfers) can be calculated. It is a measure of which web-pages tticy found most useful aft^ the 
key-word search. The combination of all surfer traces is used to create a users* choice hit-list 
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Input data set 


Output data set 


• Key-word (temporary) 

• User selections from initial search results 
(temporary), i.e. Web pages visited 
(URLs) 

• Times spent a each URL 

• IP address of user 


• Surfer trace - A list of user web*pages 
users found useful for each key-word (can 
be permanent or temporaiy) 



Users' choice hit-list: This a semi-permanent ranking of web-pages associated with every key-word and 
indicates how useful Internet users found each of the web-pages associated with the key-word. The users' ' 
choice hit-list is incrementally updated by a new surfer trace. 



Input data set 


Output data set 


• Surfer trace (can be permanent or 
temporary) 

• Users' choice hit-list (permanent)* 


• New Users' choice hit-list - Ranked hit- 
list of **popular" Web-pages (permanent) 



• The initial users' choice hit-list will be the surfer trace. 

New web-page list: This is a list of new web-pages that is created by ULR submissions from web-page 
10 developers. When a web developer updates a web-page, they can submit the web-page address, brief 
information about the page and a list of key-words that the developer decides are relevant. The web-page 
is then placed on the top of each of the key-word new web-page lists. 



Input data set 


Output data set 


• All web-page developers infomiation 
about web address and key-words 


• New web-page list (pcnnanent) 



15 Content Provider's list: This is a list (associated with each key-word) of content providers which must 
typically that pay to illustrate content witii the key-word. The price paid is dependent on the number of 
other content providers, fhe amoimt they spend and the number of times the key word is searched for. 
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Input data set 


Output data set 


• Key-word 

• Content Provider's bids for content spots 


• Content Providers list - a list of content 
associated with each key-word 
(permanent) 



High-flyers hit-list: This a list of web-pages (associated with every key-word) that are increasing in 
popularity at tiie highest rate. It is an indication of how rapidly web-pages are rising up the users' choice 
hit-list and it is used as a means to ensure Aat new emerging web-pages rise to the top of the users' 
5 choice hit-list. 



Input data set 


Output data set 


• Old Users' choice hit-list - (temporary) 

• New Users' choice hit-list - (permanent) 


• High-flyers hit-list: A ranked list of web- 
pages that are rising in popularity the 
fastest 



Personal hit-list: This a list of web-pages the individual user has found most useful for each key^word 
search they have done in the past. It is like an automatic book-marking data set for each individual user. 



Input data set 


Output data set 


• Key-word 

• Individual surfer trace - (permanent) 


• Personal hit-list: A ranked list of web- 
pages that an individual has found useful 
in the past 



Collective Search hit-lists: This can be a combination of any of the above hit-lists, The arc manydifferent 
ways that these hit-lists can be combined. 



Input data set 


Output data set 


• Crawler hit-list (temporary) 

• Users' choice hit-list (permanent) 

• Advertisers' list (permanent) 

• New web-page list (permanent) 

• High-flyers list (permanent) 

• Personal hit-list (permanent) 


« Collective Search hit-lists - (De&ult) 
Ranked hit-list of Web-pages displayed to 
the user after the key-word search. It can 
be a combination of any of the hit-lists 
above (temporary) 
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Crawler key-word list: This is a list of key-word suggestions that the user may find useful. This is found 
by matching the key-word entered by the user to tiie database of key-words and phrases that otiier users 
have tried. This is the equivalent of flie crawler hit-list, though it is a ranking of key-words rather than 
Web-pages. The method for doing this uses a similar algorithm to a spell-checker only it does it for 
5 phrases. It also suggest Key-words, based on previous URL selections from sequences of user key-words. 



Input data set 


Output data set 


• Key-word (temporary) 

• Database of all key-words used 
(permanent) 


• Ranked hit-list of other key-words the 
user may want to try (temporary) 



Surfer key- word list: This is a data set comprised a list of key-words that the individual usct found useful 
after the key-word was selected. This is found by tracking which key-words the user decided to use. This 
10 is equivalent to the surfer trace. 



Input data set 


Output data set 


• Key- word (temporary) 

« Data about what key words were used 
from the key-word suggester 


• Ranked list of oAer key-words (associated 
with the key-word) that this individual 
user found useful (semi-pemianent) 



key-word suggester: This is a data set consisting of a permanent rankmg of other key-words that users 
have foimd useful, compiled from successive surfer key-word lists and is linked to each key-woid (this is 
equivalent of the users' choice hit-list). 



Input data set 


Output data set 


• Surfer key-word list (temp or permanent) 

• Existing users* choice hit list (permanent) 


• New users' choice key-word list 
(permanent) 



15 User Based Search Algorithm 



The discussion provided above provides the language necessary to more fully describe the 
present invention. As illustrated in Figs. 3 A and 3B, which provide an overview of the search engine 
capabilities according to the present invention in which web pages are selected in making up the search 
results provided to tiie end user. In step 112, the user enters up to 4 sets of data: keyword 52, profile 
20 type 54, search type 58 and User ID 56. The IP address 62 and date-time 60 are not entered by the user 
but can be read when a user uses the search engine. This data is used is used in parallel in steps 1 14 and 
1 16 to produce list of web pages. Step 1 14, discussed in detail hereinafter, is the process of selecting web 
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pages from novel new search engine data sets produced in accordance with the present invention. This 
can run, if desired, in parallel with step 116 which obtains a selection of web pages from other existing 
search engines. Thereafter, selection of web pages from step 114 and 116 are combined and tagged in 
step 118. The process of tagging the list of web pages, described in more detail below, enables a set of 
data, shown as surfer trace data in Fig. 3, to be created and sent back to the search engine vdien the 
search engine user selects a web-page from the list in step 120. The process of selecting a tagged web- 
page creates the following series of data which is used to update the search engine data sets; keyword 124 
, URL 126, user ID 128, IF address 130, date-time 132, brief web page description 134. 

Although it is preferred to use all of these different data types in the surfer trace data, use of 
different combmations of this data is fully within the intended scope of the present invention. The 
description 134 will typically only be included in the preferred embodiment of the invention when a new 
site is added to the data set 114 of the search engine 10, and the description used will be that description 
that appears on the original list of web pages. The date-time data 132 may only indicate that a site was 
selected, rather than record the period of time a user was at a particular site, as explained fizrtfier 
hereinafter. This process is invisible to the user who, upon selecting the web-page from the list of web 
pages is taken directly to the corresponding URL, step 122. Details of the implementation of steps 114, 
118 and 120 will be described in more detail hereinafter 

After the initial selection the user may choose to access another of the web-page URL search 
results. Depending on the relevance of tiie site, die user may spend time reading, downloading, exploring 
further pages, embedded links and so forth, or if the site appears irrelevant/uninteresting, the user may 
return directly back to the search results after a short period. Hie time difference between the two 
selections is recorded as the difference between two date/time data 132 from subsequent selections from 
the list of web page searches (in this embodiment, one can only measure the time spent at one v/eb page 
if another selection is made after visiting that web page - this then provides another surfer trace 132 
which allow a time difference to be calculated). This surfer trace data on the popularity of web pag« is 
used to rank the subsequent searches, as described further hereinafter. 

Thus, according to the present invention, it is the human users' powers of reasoning and analysis 
that is being used to establish the relevance of the different resiilts to the subject matter of the search. 
The present invention utilizes the cumulative processing and reasoning of all the himian users' to provide 
a vastly more effective means of obtaining the required information sources than is presently possible 
with the type of method described above. 

As described above, human brain power is captured by recording which web pages Ae user goes 
to after each keyword search. According to the present invention, collecting the surfer trace data is 
achieved by sending, in the list of web pages generated by the search to the user, hidden links that will 
automatically send information back to the search engine (or a subsidiary server). While the user only 
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sees that his intended link is displayed, the hidden link notifies the search engine of the transfer, which 
process can be executed with a Java applet Thus, when the Internet user selects a web-page it takes the 
user to that address but also sends off a the surfer trace data to the search engine 10, which notes what 
has been selected. When the user returns to the list of web pages and selects another web page listing, 
5 another Java applet is then executed which creates another surfer trace. The difference between the data 
time data in this surfer trace from two sequential selections cq}tures the time period that the user has 
been at the previous web site. This occurs wiAout the user knowing this data is being sent 

In another embodiment, rather tiian using multiple Java applets to collect a complete list of surfer 
trace data, there is no description data 134, and the date-time data 132 indicates that a user visited a 

10 particular web site. In one specific embodiment, the user must visit a particular web site for greater tiian 
a predetennined period of time, such as one minute or fifteen minutes, depending on what is an 
appropriate time to have looked at the site for the visit to the site to count and for any surfer trace data to 
be sent back to the search engine 10, as well be described hereinafter. In this embodiment, each applet 
contains all of the information necessary to update the database at the search engine. Another 

15 embodiment collects the surfer trace data prior to a user navigating to the intended web site. Other ways 
of obtaining this surfer trace data are possible and are witiiin the intended scope of the present invention. 

Thus, the search results page according to the present mvention is therefore differently formatted 
from conventional search engines' results pages. The difference is in action rather than content Visually, 
20 the page looks the same to the user as standard search results from other search engines. 

An example illustrates this point: In a conventional search, the results page for a search of the keyword 
"Weaflier" may readrl. www.weatiier.cora Today's weather forecast. Today is expected to be fine and 
sunny everywhere. 

The HTIP link associated with the "www.weather.com" label is "http-7/www,weather.com". This means 
25 that if the user selects this link, they will navigate to this page directly 

In contrast, according to the present invention, the tagged result page for the search made suing 
the keyword "Weather" may read 

1 . www.weather.com Today's weather forecast Today is expected to be fine and sunny everywhere. 

The HTTP link associated with fee "www.weather.com" label is "link.asp?n= 1 ." If the user selects 
30 fliis link, therefore, in a process is invisible to the user, the user is first directed to the linLasp page on the 
site corresponding to the web server using the search engine 10 according to the present invention, and 
passparameter n with value 1. 

Server side code (application code that runs on the web server) uses this parameter to identify flie 
URL and description of tiie user's chosen site. This information is flien stored in a database Table along 
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with other surfer trace data. The server side code then executes a redirect operation to the user's required 
URL. The user then sees dieir required page appear. 

The source of search results is independent to this activity. The destination page of the user is 
independent of this activity. The process is one of recording a user, keyword and destination into a 
database. This method of tracldng can only record the initial web-page visited after a keyword search. If 
the user continues to return to the search results list then subsequent web-page visits can be recorded. 

The surfer trace data ttiat is sent back to the data sets 114 of the search engine 10 as a result of the 
user selecting the web-page can be encrypted to prevent fraudulent users torn sending fake data to the 
search engine. 

Another method of tracking where a user may connect to from an initial URL selection (if they do 
not return to the search result page) is to run the selected web-pages as part of a 'frame' located at the 
search engine web-site. This permits a complete record of the web pages visited to be recorded after a 
keyword is entered. However, this imposes an additional level of complexity to the system with a 
possible decrease in system response time. 

As previously mentioned, the surfer trace data that can be collected includes keyword 124 , URL 
126, user ID 128, IP address 130, date-time 132, brief web page description 134, and is identified as such 
since it provides a trace or record of how searchers (surfers) use the search engine. This data is used to 
improve ftiture searches building on the preferences of previous searchers. The surfer trace is thus a 
measure of the preferred choices of an individual user or web 'surfers' from the initial search results for a 
particular set of key-words.How the data sets are created that determine the list of web pages 

Fig. 4 illustrates the data sets used for different web-page searches according to the present 
hivention. The data sets (tables) that are used to determine the list if web pages include keyword table 
164, profile ID table 166, security table 168, cumidative surfer trace table 170, keyword URL link table 
172, personal link table 174, and web-page (URL) table 188. 

The structure of the aforementioned data sets are described in more detail hereinafter. The 
descriptions tliat follow show the data arranged in a spreadsheet fashion, witii multiple values per cell 
and many blank cells. Illustration in this manner is convenient for explaining the present invention, but is 
not an e£5cient storage and retrieval method. As will be apparent to those skilled in die art, a relational 
database model would be used to implement the data storage according to the present invention such that 
there may be multiple fields or Tables involved to store the data and each field will store only one value. 

Keyword Table (164) 

The contents of keyword data table 164 of Fig. 4 are shown in more detail in Table I shown below, and 
is a list of keywords, including phrases, and the number of times they have been requested. If the list 
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becomes unmanageably large, the key-words that are not used again after a predetermined time period 
could be deleted from the list. However is would be desirable to keep the majority or all keyword phrases 
that are entered, if possible. 



Key-word 


Cumulative number of times the key-word is 

re(|uested (W) 


Unique number for each key-word 


Key-word 1 


Wl,W2,W3etc 




Key-word 2 






Key-word 3 






Key-word 4 






Key-word 5 






Key-word 6 






Key-word 7 







5 Table 1 List of informatioii requests and the number of times it is requests 



The cumulative number of times a keyword is requested may be segregated according to the different 
'*users profiles" selected (Wl, W2, W3, ....)• e.g. Wl = total searches. W2= male profile, W3 = Female 
profile , W4 - USA profile and so forth. It should be noted that the sum of W's will be greater &at the 
total number of times a site has been visited because the user may fall into more than one profile category 
10 e.g. a male-(W2) from the USA (W3). This would become a list of not only the number of user searchers 
using that key-word but also a list of the type of user (according to the profile type selected) searching 
for that keyword. Keywords that mean the same thing in different languages are different keywords, as 
long as the spelling is different, although they could be related using the keyword suggester, as described 
hereinafter. 

15 Web-page Table (188) 

The contents of web-page table 188 of Fig. 4 are shown in more detail in Table 2 shown below, and 
contains a list of Internet web-pages. Each web-page has a URL address, an associated 2-3 line 
description, a unique web page number for each URL(which can also be any character, symbol code or 
representation) and the cumulative number of times the URL has been visited. The URL address will 
20 have a unique number (which can also be any character, symbol code or representation) assigned to it 
rather than storing the full URL string in the subsequent data-Tables. 
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line ucscnpuun 


uiiic]uc nuuiDer lor eacn 
URL address 


frequency uie URL 
(web page) is visited 


URL address 1 








URL address 2 








URL address 3 








URL address 4 








URL address 5 








URL address 6 








URL address 7.., 









Table 2 List of information suppliers and a description of tlie web-page 



Keyword URL link Table (172) 

The contents of keyword URL link table 172 of Fig. 4 are shown in more detail in Table 3 shown below. 
This table is of particular significance with respect to the present invention because it contains 
5 information about the links between information supplies (URL addresses or web pages) and information 
requests (keywords). 

This data is recorded in further data sets which describes the relationship between the key-words and 
occurrences as defined by the following three parameters. 

- the cumulative number of significant visits (hits) to each URL addresses corresponding to each key- 
10 word (herein referred to as X or weighting* factor X). This is a measure of the popularity of tiie URL 

for each keyword and is determine from the surfer traces. 

- the previous cumulative number of significant visits measured at an eariier predetermined mstant; 
(herem referred to as Y or weighting &ctor Y) 

- a date time factor relating to die instant of the creation or input of each said web-page(herein referred 
15 to as Z or weighting &ctor Z). Z is the data time in which a web-page developer submitted a web-page 

to tiie search engine. 

Not all combinations of key-words and URL addresses will have data for X, Y and Z. 
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Key-word 


Key-word 


Key-word 


Key.word 


Key-word 


URL address 1 


x,y;z 










URL address 2 












URL address 3 






X,Y,Z 






URL address 4 












URL address S 












URL address 6 












URL address 7 













Table 3 Links between information suppliers (web-pages) and Information requests (key-words) 



Profile type s with the keyword URL link Table 



The popularity of web pages will be difFeient for different groups of people. The inclusion of 
multiple profile type s will produce multiple values of X Y and Z in Table 3, e.g. one may have a Global 
and New Zealand popularity rating denoted by XI X2 Yl Y2 etc. 





Keyword "sports" 


URL address relating to Rugby 


X1«520,X2=52 


URL address relating to Basketball 


XI =4000 X2 = 20 



In this example the global popularity {using die general profile type ) for the Rugby and Basketball URL 
addresses are 520 and 4000 respectively and 52 and 20 respectively for the New Zealand profile type . 

When the general profile type setting is used (ranked based on XI), the Basketball site would be 
ranked at the top. When the New Zealand setting is chosen (ranked based on X2) the rugby site would be 
higjiest This would be a reflection of the preferences of the New Zealanders. This is a very simple 
method of storing the preference of different groups of people. 

One would expect New Zealand-based rugby web-sites to rate higher than an overseas site on the 
New Zealand list, but there is no reason that ftis has to be the case. Someone in Spain may have the best 
Rugby site in the world The system evaluates web-pages only on the perceived quality of information 
by the users -the physical location of the site is immaterial. 
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There could be a vast range of X values representing different countries, occupations, sex , age and 
so forth, enabling, the popularity of different groups to be captured very simply. Users could choose to 
combine any of the X values according to their pmonal interests/characteristics. 

As an example, if say, 

5 • XI is for males 

• X2 is for females 

• X3 is for New Zealanders 

• X4 is for USA 

• XS is for engineers 
10 • X6isfor lawyers ... 

A "male" and a **New Zealander" would using tiie search engine increment both X3 and XI. This facility 
would increase the data requirement of the system but it could vastly improve the search results for 
different users. The total popularity of the web-page needs to be stored as a separate number as users may 
contribute to more than one of the groups of people. The sum of all of the individual popularity's would 
1 5 be greater than the total popularity because user can belong to more than one profile type. 

To simplify the system for the user there would be a default profile type (selection of X's) with an 
option is to use other profile type s to do specific searches. For example, a user may have a default profile 
type of a New Zealand male, but if a technical search is required a "global engineers" profile type may 
be chosen that refiects the cumulative search knowledge of engineers around the world. 

20 The extent of personalization could be dependent on the firequency of searching. For example, 

common keywords such as "news" would have a high degree of personalization (a large range of X 
values) and less common key-word such as "English stamps" would have little or no personalization 
(only a global X value). The degree of personalization could be a function of the fi-equency that the key- 
word is used (found torn Table I). 

25 Camuiative surfer trace Table (170) 

The contents of cumulative surfer trace table 170 of Fig. 4 are shown in more detail in Table 4 
shown below. Information about the links between web pages and keywords in Table 3 ( also referred to 
as keyword URL link table 172) is updated by the surfer trace data. The cumulative surfer trace is the 
combined information from all individual surfer traces and it is used to determine how many "hits'* 
30 (significant visits) each web-page had for each key-word. 

The information collected from each individual surfer trace is a series of inputs previously described, and 
shown below in Table form 
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IP Number 


User ID 


Keyword 


URL(webpage) 


Date-time 























Table 4 Each row is one surfer trace and the combined rows are the cumulative surfer trace 



The way the surfer trace data is processed to update Table 3 is described further hereiiiafter.Pn>file 
ID Table (166) 

The contents of profile ID table 166 of Fig. 4 are shown in more detail in Table 5 shown below. This 
S table includes a unique identification, password, contact email and a defaiilt profile type which they 
normally use to perform their searches. 



User identification 


password 


email 


Default 
profile 


Other 

information 


Joe Bloggs 


dogs 


jbloggs@AOL 


US, Male 

























Table 5 User tdentificatton Table 



The users default profile type is stored as the part of the user's personal preferences profile, which 
would accessed by entering some form of personal identification to the system. This information could 
10 be supplied when logging on to the data search engine or the search engine could leave a *'cookie**, as 
that term is known in the art, on the computer to identify a user, (there would be an optional e-mail 
address and password (or similar) associated with the logon procedure). The IP address itself would not 
be a sufficient means of identification as it is not necessarily unique to the individual users. 

The other information can include user defined preferences for how the search results are combined 
15 and keywords that are of particular interest to the user. This information can be used to actively 
customize die search results and suggestions of web pages to visit 

Personal link Table (174) 

The contents of personal link table 174 of Fig. 4 are shown in more detail in Table 6 shown below. 
Table 6 is identical in structure as Table 3, and can be used to record a users personal preferences relating 
20 to each URL mcluding the number of times visited and the key-words. In this Table 6, however, Z is not 
the date that the web-page developer submitted the web-page by it is the date-time that the user visited 
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the web page. This allow the users could refine a search by defining the last time they visited the web 
page. 





Key-word 


Key-word 


Key-word 


Key-word 


Key-word 


URL address 1 


x,y,z 










URL address 2 










x.y^ 


URLaddiessB 












URL address 4 


x,y,2 










URL address 5 




x,y,z 




x,y,z 




URL address 6 












URL address 7 













Table 6 Links between information suppliers (web-pages) and information requests (key-words) 
5 for an individual user 

The data in Table 6 is only accessed by the mdividual that created it, and accessible using a user ID 
that is preferably independent of changes in the user's e-mail or IP address changes and would thus 
enable their past personal preferences to be retained during such changes. 

This Table 6 data set could be stored either at the search engine site or on an individual's computer. 
10 Storing on local PC's would require additional software to be installed on the users computer. There are 
numerous advantages to storing the information at the search engine including the fact that users are 
likely to go there more often and unlikely to change search engines once they have a substantial book 
mark list 

Security Table (168) 

15 The contents of security table 168 of Fig. 4 are shown in more detail in Table 7 shown below. To 

ensure that users do not submit the same key-word over and over to increase its popularity the following 
security data table is used. Each entry is a single piece of information i.e. yes or no. This table can be 
created for links between keywords and IP addresses or links between keywords and User ID*s. 





Key-word 1 


Key-word 2 


Key-word 3 


Key-word 4 


IP address 1 


1 








IPaddress2 








1 
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IP address 3 










IP address 4 








1 


IP address S 




I 







Table 7 Security Table to ensure one computer user does not submit keywords to artificially boost 
the popularity of a web-page 

Described hereinafter are the processes that are used by die present invention to populate each of the Fig. 
4 tables mentioned previously. 

5 Populating the keyword Table 164 

This table is populated every time a user enters a keyword 52 to the search engine. A submitted 
keyword is compared to the keyword list in Table I (keyword table 164} and added if it is not already 
present. If it is present, the cumulative number is increased by one. If the user has a profile type then the 
cumulative number for the keyword for each type of profile will also be incremented (W1,W2 W3 etc). 

10 Populating the web-page data Table (URL Table) 188 

This table is populated in a number of ways, including: 

• user selecting a URL address 126 that is not already in Table 2 (URL table 188). The URL 
address 126 and description 134 are put directly into die web-page data table 188. The new 
URL is assigned a unique identification number. 

IS • in Step 176, as shown in Fig. 4, web-page developers can submit a URL 187 and description 

68 which also goes directly into the web-page data table 1 88, 

• web crawlers may also add URL addresses and descriptions (the description is either the first 
few lines of the web-page or in the HTML coded 'title"). This is not an essential element of the 
system but it could be a mediod to obtain URL*s and descriptions. With this search system web 

20 crawlers are more likely to be used to verify the information mther than find new information. 

Populating the cumulative surfer trace Table 170 

The cumulative surfisr trace table 170, also referred to above as Table 4, is populated each time a 
^tagged" web-page is selected by a user. This sends a packet of surfer trace mformation, such that die 
surfer trace data is added to the table each time die user selects anodier web page from a web page list. 

25 Populating the keyword URL Unk Table 172 

The data &om the ounulative surfer trace 170 Is used to update the popularity of web pages as recorded in Table 3 
(X,Y), also referred to as the keyword URL link table 172. The frequency of updating Table 3 with the data from 



wo 99/48028 



PCT/US99/05588 



-21- 

the cumulative surfer trace (170) to obtain new values of X and Y is a variable Aat can be changed, bom ranges 
that are shorter than cvexy hour to longer than cvciy monflt It should be noted that different keywonls con be 
updated at different intervals of time. 

An intennediaiy step in processing the cumulative surfer trace is to form a cumulative surfer hit 
5 table. This is subsequently used to modify the values of and X,Y in Table 3 

As mentioned above, the simplest method of recording a link ("useful visit" or "hit*0 between a 
keyword and a URL would be to count each keyword, URL paring in a surfer trace as a •'hit". A more 
meaningful and sophisticated method is only to count a location selection as a valid if the user meets 
certain criteria. This criterion could be the user exceeding a specified time at a location. If this criterion 
10 was not met, the selection would not be increase the cumulative value of X in Table 3. 

It is also possible to increment the value of X based on the time spent at the web page. The longer 
the time spent the more this increments the value of X. X does not have to be a whole number. 

Due to the variations in web-site capabilities in terms of log-on times, down loading times, 
bandwidth, and response times, the predetermined time used to denote a valid 'hit' may be suitably 
IS altered. Specialist web crawlers may be employed to independently validate such data. 

The selection of a content provider's banner after a keyword search counts as a hit for their web- 
page (increment the value of X). This will enable their web pages to possibly go up the popularity list 
associated with the keyword This acts as a mechanism to enable a web-page developer to pay to be seen 
with a keyword They can not pay to go up the popularity list - this will only occur if people visit their 
20 site and spend time there and record a valid hit for the popular list. The values of a content hit can vary 
(e.g. if could be 1 or 0.5 or 7) depending on the emphasis one wants to place how much that content 
affects the popularity ranking. 

This cumulative surfer trace information can be processed in a large number of ways to populate 
Table 8 (below). Groupmg the cumulative surfer trace according to the IP addresses or user ID produces 
25 the search pattem for an individual usas. This is a list of key-words and URLs and times. This allow the 
time spent at each web-page to be calculated for each user (it is not possible to calculated the time spent 
at flie last web pages of a search session as fliere is no time record after Aey go to that web page) 

If the time between each visit is longer than a certain time period, one is added to the cumulative 
surfer hit (a) table for the key-word URL. (this is the simplest method, methods in which relevancy is 
30 proportional to the time spent at the site, for example, are also properly within the scope of the present 
invention). 
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Key-word 


Key-word 


Key-word 


Key-word 


URL address 1 










URL address 2 


a 




a 




URL address 3 




a 




a 


URL address 4 




a 






URL address S 










URL address 6 






a 




URL address 7 








a 



Table 8 cumulative surfer hit table created from accumulated surfer traces 

The cumulative surfer hit is used to update the value X in Table 3 in the following way 
X(new) =(X<oid).HF)+a. 

HF is the history factor which is a number between 0 and 1. The history factor does not have to be the 
same for every key-word and could be varied depending on the rate at which die keyword is used. 



The data collected for Table 8 is used to recalculate the values of X in Table 3 after a predetermined 
time period. The frequency of updating Table 3 will influence the value of the History factor (HF) 
chosea The reason for multiplying the existing X by a 'liistory factor" is so that tfie perceived 
popularity does not last indefinitely. The history factor reduces the weighting attached to the past 
popularity. To illustrate by way of an ejcample, the key-word "sports news" may have an existing 
popularity with the following ranking (based on the number of hits per web-page, X) 

1 Winter Olympics web-page X= 1 9000 

2 Soccer results web-page X-1 8000 

3 Baseball results web-page X-ISOOO 

4 Golf news web-page X=15000 



The cumulative surfer hit Table for a week may be: 



1 . Winter 01yn^}ics web-page a=500 

2. Soccer results web-page a-1800 

3. Baseball results web-page a^lSOO 

4. Golf news web-page a=4600 
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The reason for the change in the number of hits reflects the fact that the winter Olympics has finished and 
the Master golf tournament has started. If one has a •'history factor" of 0.9 then the new popularity (X) 
will be: 

1 Golf news web-page 18100 (0.9 xlSOOOf 4600) 

2 Soccer results web-page 1 8000 (0.9 x 1 8000+1 800) 

3 Winter Olympics web-page 17600 (0.9 x 19000+500) 

4 Baseball results web-page 15000 (0.9 xl5000+1500) 

5 Thus, the more popular web-pages can emerge and the less popular decline, reflecting the fluctuation of 
interest over time in different subjects and events. 

The database is therefore utilizing the human mind to provide a powerful indication of what people 
find useful on the Internet. The users themselves replace a substantial computation requirement that 
would otherwise be required to filter through such searches. 

10 The value of Y in Table 3 is the old value of X, and the value of Y will be updated at intervals that 

are deemed appropriate, which interval could be minutes, hours, days, weeks or longer. The iqxlate 
interval does not need to be the same for all different keywords, as previously mentioned. This is used 
to calculate the rate of change of popularity of web pages and can be used as a selection criteria. 

Different profile type s in the web-page/USL link Table 

15 The cumulative surfer trace includes information on users profiles so Table 8 can be calculated with 

subscripted values of a for different profile types. These values of au aa as etc would correspond to the 
profile types for the subscripted values of X. This allows the popularity of different groups of people to 
be recorded. 

New web-page data input to the web-page/URL link Table 172 

20 The simplest method of having new pages recorded by the search engines is for web-page 

developers to submit information, shown as action 176 in Fig. 4, which information includes URL 66, 
key-words 70, site descriptions 68, target audience 72 and date-time 74, each time they create or update a 
web-page. 

This infonnation directly updates Tables 2 (URL tablel88 of Fig. 4) and 3 (Keyword URL link table 
25 172 of Fig. 4). The URL 66 and description 68 are entered in Table 2 and ttie date-time (74) at which fee 
page is submitted (die Z value) is inserted in Table 3 for each of the key-words (70). Users are allowed a 
set number of keywords 70 with which they can submit their web page. An example of what Table 3 
would look like witfi just Z values is given below (format dd-mm-yy). 
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Key-word 


Key-word 


Key-word 


Key-word 


Key-word 


Key-word 


URL address 


27/02/98 




27/02/98 








URL address 


28/02/98 


28/02/98 




28/02/98 






URL address 














URL address 






18/02/98 




18/02/98 


18/02/98 


URL address 














URL address 








28/02/98 






URL address 




29/02/98 











Table 9 Data Table created from submission by web developers 



If there is no date for the combination of the URL and keyword in Table 3, then the new date is 
automatically inserted. If a date already exists in the Table, then the dates are compared and if the dates 
are too close, i.e. less than a pre-determined period, then the old date remains and the new date is 
5 ignored. This stops people from constantly resubmitting to get on the top of the new^web page list by 
resubmitting their web pages. If the URL in Table 3 has other keywords wifli values of Z closer than the 
pre-determined period then the submission is also not allowed. This stops web-page developers from 
resubmitting their web pages with difTerent sets of keywords. 

When users submit a URL they could target it at specific types of users (different profile type s Zl , 
10 Z2, Z3 etc) as per Table 3. For example, an URL submission specifically targeted at New Zealanders 
(e.g, Zl) will appear at the top of keyword new list when New Zealanders search for that keyword. It 
will remain at the top until someone else submits a URL for that keyword targeted at New Zealanders. 
URL's that are targeted at other audiences will not appear as new sites for New Zealanders or 
alternatively they will not feature as high in tiie new list as the ones specifically targeted at New 
15 Zealanders. 

The data on new web pages does not necessarily have to be entered by web-page developers. It 
could be automated by having a web document template that automatically submits data to the search 
engine whenever the information on tiie web-page has been significantly changed. It would prompt the 
web-page developer to change any key-words as appropriate. 

20 Another embodiment requires sending specialist crawlers out to find web site addresses and key- 

words, though Hds has many of the drawbacks of existing web-crawlers. It could only be effective if web 
designers deliberately configured their page with the key-words identified. Any web site 
designer/proprietor willing to do this would also presumably be willing to submit any updates to the 
search engine to benefit from the instantaneous listing on the search results. 
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An extension of fliis principle is to auto-detect if a web address possessed key-word information in 
the database and then automatically send an invitation to provide the information to enable their web- 
page to be found easily. The ideal number of tey-words to be submitted with each web-page is 
preferably less than 50 and probably preferable within the range of about 5 and 20. This also 
5 advantageously forces web-site designers to find the most appropriate keywords to describe fteir site and 
also enable them to choose the audience they wish to target. 

The web-page submission process may also mclude web-page developer identification process that 
restricts the ability of people to use the system fiaudulently. This may include a payment to prevent 
multiple web-page submissions. 

1 0 Populating the profile ID Table 166 

ID table 166 of Fig. 4 is populated from the direct inputs from users. When users search the can 
choose their profile type 54 from a layered drop down menu, which could include, for example: 

• Gender (Male or Female) 

• Occupation (Professional, student etc) 

15 • Age category etc 

The user selects different profile types from tiie options they are prompted if they wish to save this as 
flieir default profile type. This is then recorded in Table 5 (profile ID's tablel66). The user may also 
select personalization options from a specific personalization options page rather than a drop down menu 
on &e search page. 

20 Populating the personal link Table 1 74 

The cumulative surfer trace is used to identify the search patterns of individual users based of 
sorting by User ID 126. This information is used to update the personal link table 174 in the same way 
that the cumulative surfer trace 170 is used to update Table 3 Qceyword URL link table 172). This table 
stores users past preferences as a.form of automatic book maridng. 

25 Populating the secority Table 168 

Each time a user enters a keyword 52 into the search engine it updates the security table 168 (Table 
7) by making a link between the keyword 52 and the P address 62 (or making a link between the 
keyword 52 and the User ID 56). The data in Table 7 is cleared periodically as the purpose is to stop 
systematic repeat searching from affecting the popularity lists (value of X in Table 3) rather than 
30 stopping individuals who occasionally perform the a repeat keyword search from affecting the popularity 
list 
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Determining tlie list of web pages 

Fig 5 shows the various data sets previously described, and various inputs and actions Aat lesult in a 
list of suggested web pages being provided, and will be described in more detail hereinafter. As shown in 
Fig. 5, user data entered into the search engine can include; keyword 52, user ID 56, search type 58, ff 
address 62 , profile types 54. How this data can be used to determine a list of web pages 250 as well and 
deciding which of the list of web pages to tag (step 118 of Fig. 3) for the purposes of creating a surfer 
trace is described hereinafter. 

The numbers PC Y and Z) in Table 3, which correspond to k^ord URL link table 172 in Fig. 5 
contain all the information required to give tiie following types of searches 58: 

• Popular-list search ranked hit-list of the most popular URLs for that keyword based on the number X 

• Hot off the press search ranked hit-list of newest URLs for the keyword based on the date/time (Z) 

• High-flyers search ranked hit-list of best emerging URLs based the dififeren^^ 

• Random search hit-list that is a random sample of URLs that have any of the numbers X, Y or Z 

• Date created search this is hit-list based on the date time Z and the user-specified date of interest (not 
just the newest) 

The personal links table 174 also allows past preferences to be listed as search results 

• Previous favorites search is a ranked hit-list base on the previous popularity for the individual (X from 
Table 6). This search is based only on the previous searching of the individual user. This allows the 
users to very quickly find site &at they have previously visited. 

A number of other search options are also available. 

• Conventional search is the list of search results from a normal search engine ( 1 1 6 Fig 3} 

• Othercontentonly search. This is a list of othCT content, such as advertisements, associated with the 
key-word. 

These search results can be combined in a number of different ways 

• Collective search ranked hit-list that is a collection of any of the search hit-lists described above (this 
is the default set of search results) 

• Customized search ranked hit-list that can be a user defined combination of any of the above lists. 

Fig. 5 also illustrates the use of keyword table 164 and security table 168 in a decision 246 to send 
out tagged web pages. This decision is based upon the frequency of key word usage, the data in the 
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security table and the presence of a user identification. The details of the decision to send out tagged web 
pages is described fiiUy in figure 16. 

How the different types of search lists are implemented 

More details on how each of these types of searches is implemented is provided below along with 
5 some of the advantage and disadvantages of each. The system relies on the brain power of fte user, this 
time to determine what sort of search they want to do which will depend on what they want to find The 
search methods are described easily so users should intuitively know which one to use. 

Popular search. 

Fig 6 illustrates the process for determining a list of popular web pages associated with the entry of 
10 a keyword 270 in step 272. If this search is selected and a keyword is entered, step 274 follows and 
produces a list of web pages based on the values of X taken from Table 3 (172, Fig 5) for the keyword 
270 entered. These web pages are identified by a unique web-page(LIRL) number ftom Table 3. 
Thereafter, in step 276 tiie list of web-page numbers found from step 274 is combined with the URL 
address and web-page description from Table 2 (188 Fig 5). In step 278 the resulting list of web pages is 
15 then tagged, depending on the results of step 246 in Fig 5 as described previously, and sent to the usct for 
them to make their selections.Hot off the press search. 

Fig 7 illustrates the process for determining a list of new web pages associated with the keyword 
entered in step 290. If this search is selected and a keyword is entered, step 294 follows and produces a 
list of web pages based on the values of Z taken from Table 3 (keyword URL link table 172 of Fig. 5) for 
20 the keyword entered in step 290. These web pages are identified by a unique web-page (URL) number 
from Table 3. Thereafter, in step 296 the list of web-page numbers found from step 294 is combined with 
the URL address and web-page description from Table 2 (URL table 188 of Fig. 5). In step 298 the 
resulting list of web pages is then tagged depending on tiie results of step 246 m Fig 5 as described 
previously, and sent to the user for diem to make their selections. 

25 The user will also be able to see cxacfly ^en each web-page was submitted so Internet users can be 

aware of its currency. An indirect consequence of this feature is the incentive for web designers to 
iqjdate their sites. The prominence given to new and updated sites provides a means of becoming 
established on the popular hit-list and encourages the use of appropriate key-words and rewards the up 
keeping of web pages that users find useful. 

30 High-flyers search. 

Fig 8 illustrates a high-flying web pages search associated with the keyword entered m step 320. 
This is a list of web pages that are increasing in popularity fastest. If this search is selected and a 
keyword is entered, step 324 follows and produces a list of web pages based on the relationship between 
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the values X and Y taken from Table 3 (172, Fig 5) for the keyword 320 entered. These web pages arc 
identified by a unique web-page (URL) number from Table 3. Thereafter, in step 326 the list of web-page 
numbers found from step 324 is combined with the URL address and web-page description from Table 2 
(1 88 Fig 5). In step 328 the resulting list of web pages is then tagged depending on the results of step 246 
S in Fig S and sent to the user for them to make their selection. 

The high-fiyer list is calculated by comparing the old popular ranking (Y) and the new popular 
ranking (X) from Table 3. From this the percentage increase in hits is calculated. An alternative method 
would be to rank the rate of change of popularity by the number of places they rose compared to last 
time. 

10 The formula of calculating the rate of change of popularity for this embodiment is given by: 
((X.YyY).pC/(XJJ)) 

where is the maximum value of X for the coiresponding key-words and fl is an additional variable 
fliat can be changed to alter the relative significance of changes at the top and bottom of the popularity 
list 

15 The reason for multiplying by the maxhnum value of X is to ensure that small changes at the lower 

popularity levels do not swamp more significant changes higher up the table. For example, a web site 
having previously recorded only one selection and then attracting 5 hits the next day would exhibit 
percentage increase of 500% whilst another web-page may have experienced an increase from 520 hits to 
4000 hits (a much more significant increase) though this would otherwise appear as a lower percentage 

20 . increase. 

Random search. 

This is a random selection of less-popular web-pages for the user that want to Jook at web-pages off 
. the beaten track, based upon a random selection of web pages that has any value of X, Y, and Z 
associated with a keyword that is entered. Accordingly, after a user enters a keyword in step 352 as 

25 indicated in Fig. 9, reference is made to the keyword URL link table 172 illustrated in Fig 5, and a 
random , list of web pages numbers are generated automatically using a random number generator are 
determined, as illustrated at step 354. Only web pages that have values for X, Y or Z associated with the 
key word are chosen m this random selection as this indicates that at some stage in the past as used or 
web page developer thought the web page had some connection to the keyword. Thereafter, in step 356 

30 the list of web-page numbers found from step 354 is combined with the URL address and web-page 
description from Table 2 (188 Fig 5). In step 358 the resulting list of web pages is then tagged, depending 
on the results of step 246 in Fig 5 as described previously, and sent to the user for them to make their 
selections. 
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Conventional search. 

This is the nonnal search method of a conventional search engine, referenced as other search engine 
116 in Fig 3, which niay or may not be included along with the searches according to the present 
invention, at the option of the user, as noted previously. 

5 Content only search. 

This is a list of content, such as advertisements, associated with the key-word, which the user cannot 
control. The ones that have paid flie most will be at the top of the list, as described further hereinafter, in 
accordance widi the preferred embodiment of the invention. Of course, othor systems for identifying the 
order of paying content providers can also me implemented. 

10 Previous favorites search. 

Fig. 10 illustrates a previous past favorites search, that is based only on the previous searching of 
the individual user. This allows the users to very quickly fmd sites that they have previously visited and 
performs, therefore, automatic book marking. It should be noted that since a password is preferably used 
to logon to the search engine system according to the present invention, the user will be able to access 
1 5 their personal preferences from any computer. 

Thus, when the user types in a keyword at step 372 as indicated in Fig. 10, step 374 follows during 
which it is determined what are the favorite sites (based on previous usage) for &at keyword from the 
personal link table 1 74 illustrated in Fig. 5. Because the user has a password that can be used to logon to 
the system the user will thus be able to access their personal preferences form from any computer. 

20 Due to this search capability there is, therefore, no need to manually bookmark web pages. If a user 

forgot to book-mark a good site on, for example, 'maibles*, they can easily fmd it by i^typing the 
keyword that lead them to tfiat site. If a user's preferences change they will be reflected in the personal 
links table 174. 

Anoflier embodiment of the personal preference search includes specifying the date the web page 
25 was last visited, with or without using a keyword. Hie web pages arc then ranked based on Z in personal 
links table 174 of Fig. 5. For example if a user looked at a site in die middle of last year the user can 
refme the search by date, thus making it easier to find a previously useful web-pages more easily, even if 
they could not remember the relevant keyword. 

This automatic book-marking feature can also act as a device for monitoring the type of Internet use 
30 being undertaken by a particular computer and thus for example, can provide wammg to 
parents/employers of children/employees accessing undesirable sites, such as adult web-pages. In a 
preferred embodiment, for parents/employers unlikely to use the computer themselves, notification of 
such usage is automatically provided by letter to the parent/employer thatlists the keywords selected and 
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web pages visited by the children/employees. This information is found directly from each user table 
174 of Fig. S.This requires a user identification code tiiat also included parratal/employee infonnation. 

Collective search 

The collective search, as illustrated in Fig. II, is the de&ult search according to the present 
invention and is used when the user does not actively choose on of tiie other search options. 

Upon entry of a keyword in step 402, that keyword is used to select from a combination of web page 
selections associated with that keyword. As shown, for example, in step 404, an equally weighted 
combination of conventional, popular, highflier, new and past search results is used to obtain a list of 
web page numbers. Thereafter, in step 406 the list of web-page numbers found from step 404 is 
combined with the URL address and web-page description from Table 2 (188 Fig 5). In step 408 the 
resulting list of web pages is then tagged^ depending on the results of step 246 in Fig S as described 
previously, and sent to the user for ihem to make their selections, the system is first configured, the 
search engine 10 database will not posses any information on popular, high flyers and new web page hit- 
lists, so search results will initially be obtained from the conventional hit-list (normal search engine), and 
the tagged web pages dien used to create the database sets as have been described. As the system 
develops, the data sets associated with each of the other search types will become populated, and 
searches using the odier search types will become more useful. 

Date created search. 

Fig 12 illustrates a date created search that allows the user to select the date that the web-page was 
submitted. This feature wiU only work for web-pages that contain a date created data entry, identified as 
date-time submission 74 in Fig. 4. Upon entiy of a date-time and/or a keyword in step 432, the search 
engine 10 will perform step 434 in which a list of web page numbers associated with these variables is 
obtained. Thereafter, in step 436 the list of web-page numbers found from step 404 is combined with the 
URL address and web-page description from Table 2 (188 Fig 5). In step 438 the resulting list of web 
pages is &en tagged, depending on the results of step 246 in Fig 5 as described previously, and sent to the 
user for tiiem to make their selections. 

Customized search 

Fig 13 illustrates a customized search that allows the user to decide how they want their default hit- 
list to appear. In step 462, the keyword and User ID is selected in order to initiate the customized search. 
Prior to initiating the customized search in step 466, which step is identical to step 404 of the collective 
search deviously described widi respect to Fig. 11, however, step 464 is applied to customize tiie users 
de&ult mixture of hit-lists For example a user may want their de&ult search results to include only 
popular and new web pages but no high flying web pages. This custom search is then performed in step 
466 to generate a list of web page numbers. Thereafter, in step 468 the list of web-page numbers found 
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from step 466 is combined with the URL address and web-page description from Table 2 (188 Fig 5), In 
step 470 the resulting list of web pages is then tagged, depending on the results of step 246 in Fig 5 as 
described previously, and sent to the user for them to make their selections, one preferred embodiment, 
the make-up of the default search results list can be amended by •learning' from the user's behavior to 
5 create a changing customized search based on the user's own search patterns. If a user consistently 
chooses new web pages or high-flying web pages for example, then their set of default search results will 
be changed to reflect their nonnal search style. 

Magazine search. 

The magazine search according to the present invention enables users to search by following a 
10 series of menu-driven subject choices (or similar hierarchical structure), rather than entering a specific 
key-word(s). 

Existing magazine-style search engines require editors to set the structure of information, decide on 
its relevant merits and set the criteria, such as price, for space on a given page transmitted to the 
user/viewer. Using the search system of the present invention, the users* themselves dynamically decide 
15 what is and is not worth seeing. Thus, although editorial input is needed regarding a hierarchy of 
subjects, the web-pages that emerge as the most popular for each of these subjects will evolve 
automatically. 

Use of data sets for different groups of people 

Different popular hit-lists may be employed to provide results which would reflect different cultural, 
20 geographical, professional, gender or age interests. Thus, as shown in Fig. 14, when a user enters a 
keyword and User ID in step 490, the defeult profile of the user can be used to reflect the type of web 
pages that people of the same "group" as the user profiles desire to see. Hius, the search that takes place 
in step 494 is based on the subscripted X,Y and Z values obtained fix>m the default profile of people of 
those "group" affiliations identified in the user's personal profile obtained in step 492. Thus, therather 
25 than an overall global search result, search results are obtained particularized for the group that the user 
identifies with. The resulting list of web pages, derived from steps 496 and 498, as have been previously 
described, are particularized for that group. 

Thus, for a particular user with the profile type New Zealand selected as a geographical factor, a 
search for team field sports and related key-words, rugby material migjit figure prominently, whereas an 
30 American profile type may produce a bias towards basebaiyAraerican foo&all material, for exan^le. 
This technique offers the ability to discriminate between the different meanings of the same words, 
according to ±e context of the popular hit-list associated with a particular profile type. A general search 
using a key-word 'accommodation' for example would include results related to housing, renting and 
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similar, whereas if the user indicated an interest in optometry in their profile type , then the tism 
'accommodation' would be interpreted quite differently. 

The relevance of such sites will evolve automatically, without any active evaluation oif the sites by 
the search engine operator or the user. There are no complex algorithms required to analyze the 

5 relevance of web-sites for particular types of users. Instead, the type of site deemed relevant will be 
decided by those users selecting those characteristics for their profile type , i.e. American females 
interested in rock-climbing. Sites of greater relevance will naturally attract more hits, increasing their 
ranking and thus increasing the chance of a subsequent user also investigatmg the site. In the above 
example, any web sites listed for the keyword •accommodation' which were unrelated to optometry, 

10 sight, lens, vision, etc., would not be accessed for the poiod of time required to make a valid hit It 
would therefore receive a very low ranking and hence be even less likely to be accessed by further users. 

The user can select different profile types for different searches during a single session and is not be 
restricted to the default profile types. 

In a further embodiment of the invention, there can be included a level of authentication for person's 
IS of a certain group to have their search results actually be used for purpose of updating the database 
relating to that group. For example, doctors who have a user ID that identifies them as doctors may 
perform a search related to a certain medical condition, and their selections can be tagged and used in the 
database for that group of doctors as has been previously described. However, although patient's may 
desire to identify their profile with that of the same group of doctors, their selections are not as 
20 significant as those of the actual doctors, and thus while they are able to view the web page listings that 
doctors deem most pertinent, their selections are not used to update the doctor's group database, since 
their IDs do not identify them as a doctor 

Limiting search options 

Another feature of die present invention is keyword eliminator feature, which is illustrated in Fig. 

25 IS, and prevents certain users, such as children, from searching for undesirable keywords and web-pages 
when the keyword eliminator feature is turned on. The present inventor's have realized that it is 
potentially much easier for example, to stop children searching for pornography, rather than attempting to 
trace and prevent access to all sites on the Intcmet with pornographic content. This would be used as a 
complimentary tool to existing "net nanny" type devices. Thus, as shown in Fig. 15, with the keyword 

30 eliminator turned on, a preexisting table faiaccessible keywords is stored in a table and compared m step 
522 with a keyword previously entered, as shown by step 520. Thus, keywords that are inaccessible will 
not be searched. Thus, for example, parents could choose the types of keywords 552 that they do not 
want their children to search for - and this will be different for different sets of parents. The system 
filters out the keywords that may be used for subsequent searching in step 524. 
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Determining ivhich users to sample 

Fig 16 illustrates die process of detennining which search results should be sampled and used to 
make up the cumulative surfer trace table 170 of fig. 4, also referred to as Table 4, While possible, it is 
not necessary to collect data concerning every single search, and this can be controlled by determining 
5 which sets of results get sent out with 'lagged" web pages. Reference with respect to this was aU^ady 
mentioned with respect to authenticating user's of a particular group, doctors in the example provided. 

As shown in Fig. 16, after entry of keywords and other data in step 554, there are three decisions 
that determine whether results are actually "tagged" as has been previously described in step 118 of Fig, 
3. 

10 As shown by step 556, for a user that has a user ID and has chosen to use the personal links table 

174 of Fig. 5 (Table 6) as previously described, it is necessary to "tag" all of their results so that all of 
their past preferences are recorded in their personal links tablel74. The search engine according to the 
present invention system can update the user*s personal preferences but not update Table 3 if certain 
security levels have not been satisfied (see below). If, however, the personal link table 174 is stored on 

15 an individual's computer rather than at central location there is no need to send out tagged results as the 
data is stored locally. 

As shown by step 558, when a keyword is submitted, a check is made that the IP address 62 has riot 
already searched the keyword using security table 168 (Table 7) before the user is sent a set of tagged 
results. If so, the user can still undertake the search though it will not contribute to the cumulative surfer 
20 trace 170 (Table 4). This allows all normal users to afiect the popular hit-list and all users to search 
whatever they would like, but prevents fi^udulent users, such as spammers, from contributing to the 
popular hit-list. The security table 168 can also include information on links between keywords 52 and a 
user ID S6 to detect repeat searching. 

While it is possible for user's to change the IP address of their computer, this is also detectable and 
25 preventable by a number of methods such registoing and tracking the use of IP numbers. 

Other methods to exclude false searches include: 

• Only creating a surfer trace for users with a user ID 554 recorded with the search engine. 

• Extending the time limit requited to make a visit count as a useful hit 

• Do not count single visits to a URL from a keyword (for which there is no means of measuring a 
30 lapsed-time). 

As shown by step 560, popular keywords can be traced once every tenth, hundredth, or even 
thousandth occurrence, and the frequency of this selection can be changed to optimize the system. The 
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frequency of keyword usage is determined from keyword table 164 as shown in Fig. 5 (Table 1). The 
frequency of sending out tagged results can also be linked to the rate at which popularity is changing for 
different key words. For example the keyword •'IBM" would probably have IBM's home page at the top 
andmost user's would go there, whereas the key word **Iatest fiids'* may have a constandy changes set of 
5 web pages that needs to be sanqjled more frequently. 

To avoid the keyword URL link table 172 of fig. 5 (Table 3) from becoming unduly large, one method 
is to only register keywords in Table 3 once they reach a certain frequency of usage. This is controlled by 
not sending out tagged results for less frequendy used keywords (found from Table 1). 

Active suggestion of web pages to visit 

10 Another feature of die present is illustrated by Fig 17, and involves using data to actively suggest 

web pages. This is different from a search because the user sets up the request and is mformed if there is 
any new data on the subject To do this the users has to actively specify which keywords ftey are 
interested in and the profile type that they would like to act as a filter or agent and the search type (new, 
highflying, popular) in step 588. This information is stored in the user's profile ID 166 shown in Fig. S 

15 (Tables). 

Thus, at various interval's the user receives a list of suggested web pages determined by a group of 
like minded humans. For example a user may choose to be notified of web pages with the following 

Keyword 582 profile type (agent 588)Search ^e 586 

Rugby New Zealand, Male highflying 

20 Decay treatments Dentist new 

This way if there are highflying web pages on "rugby" that other New Zealand males found useful 
(i.e. they spent a significant amount of time looking at the information - high rate of change of X m 
Table 3) the user would be notified. Similarly if there was any new infonnation on "decay treatments" 
submitted for dentists to look at, die user would be identified about it (value of Z in Table 3). It is 
25 unlikely tiiat a computer agent will ever be as good at filtering infonnation as a selected group of peers. 
An advantage of this system compare to other "agent type" software is that diis does not require any 
software on the user's computer. It is all included as a natural extension to die other search engine data 
sets. 

The suggested web-sites can be displayed for the user when they next access the search engine or 
30 they may choose to be notified of diese suggested web pages via e-mail notification. This way web 
pages can be drawn to the user's attention widiout any active searching for these keywords. 
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Passive suggestion of web pages to visit 

Another feature of the present is illustrated by Fig 18, and involves automatic web-page suggestion 
based on how &e user has searched m die past and requires no active input from tiie user. 

As shown, in step 620, upon the entry of a user ID, the system can be activated passively, at various 
5 intervals or times (such as at each login to the search engine), by looking at which keywords, profile 
types and search types, the users frequently looks at using the personal links table 174 of Fig. 5 (Table 
6). For example, it may be that the user frequently looks at Rugby infonnation as a "New Zealand, male" 
and looks at decay treatments as a "dentist". This information can be found from the automatic book 
marking table, previously referred to personal links table 174. If the user has not looked at these sutgects 
10 for a certain length of time and there arc new or higjiflying information sources, the user will be 
automatically notified of these new infonnation sources. 

In a modification of fliis embodiment, a periodic e-mail can be sent out with the two newest and 
highest flying sites related to the key-words of the user. 

Determining a list of suggested keywords 

15 A problem with Internet searching for many users is knowing which key-word to use for searching- 

While the present invention could be implemented with an infinite number of keywords, too many k^- 
words (includes phrases) that users choose can be problematic. 

Accordingly, as shown in Fig. 19, the present invention also provides for a data set 642 that provides 
synonyms for the keywords entered along with the particular profile type in step 640. The system 

20 represented in Fig 19 is referred to as a key word suggester. This is implemented, m one embodiment, 
by matching the key-word entered by the user in step 640 with the existing key-words and phrases in 
keyword table 164 of Fig. 5 (Table 1) that other users have tried using other search methods, identified in 
step 646. Each keyword is then tagged in step 660, and those that are selected by a user in step 662 are 
used to form a keyword surfer trace 648 as shown in Fig. 19, which contains the original keyword 52 diat 

25 the user entered, the keyword selected 652, and the IP address 130. usct ID 128 and date-time 132 data as 
in the previously described web page surfer trace. 

The data from the cumulative keyword surfer trace 648 is then used to reinforce links between 
keywords. In this way the system learns which keywords are associated with each other. The system 
learns which words are related to each other in the same way that the system learns which URL's are 
30 associated with the key-words. The lists of suggested keywords will become more relevant over time as 
the relevancy is improved each time the keyword suggester is used. 

Creating data sets that determine the suggested keywords 
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As shown in Fig. 20, a keyword link table 696 and a cumulative keyword trace table 698 are used 
along with the previously described security table 168 to create the data sets for suggested keywords. 
The key-word link table 696, shown in Table 10 below, records how often each key-word is selected from 
the suggested key-word list This can then be used to rank the of the usefulness of different key-words 
5 relative to each other. 





Key-word 1 


Key-word 2 


Key-word 3 


Key-word 4 


Key-word 5 


Key-word 1 




5 








Key-word 2 


20 




1134 






Key-word 3 




356 








Key-word 4 












Key-word 5 




20 








Key-word 6 






3 






Key-word 7 










168 



Table 10 Keyword link Table 



It can be seen from the Table 10 that people who entered key- word 2 found key-word 3 the most useful 
followed by keyword 5 then key-word 1. The keywords can have a directional aspect, for example, 
keyword 3 was found useful 1 134 times after trying keyword 2. However keyword 2 was found useful 
10 only 356 times after users tried key-word 3. 

Information about the links between keywords in Table 10 is updated by the information about how 
people are using suggested keywords (keyword surfer traces 648). The cumulative keyword surfer trace 
698 is the combined information fit)m all individual keyword surfer traces 648 and it is \ised to determine 
how many *'hits" (significant visits) each keyword had for each key-word 

15 The information collected from each individual surfer trace is a series of inputs become a cumulative 
keyword surfer trace, shown in table form below in Table 1 1 . 



IF Number 


User ID 


Keyword 
(original) 


Keyword 
(suggested) 


Date-time 























Table 11 Keyword cumulative surfer trace 
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Populating the keyword link Table 

Fig 20 also illustrates how links between keywords in Table 10 can be initiated by recording 
sequences of keywords that users put into the search engine. If, for example someone searches using flie 
keyword •'NHL" and then **National Hockey League", this would then draw an association between feese 
5 two key-words in Table 10 by recording this as one hit. Again this captures the reasoning power of 
users to define the link between two keywords. Often the keyword in sequence will be totally unrelated to 
the previous key-word but sometimes it will be relevanL If the next user chooses it from the key word 
selector it will reinforce the key-word link in the same way that repeat selection to web pages reinforces 
links between a keyword and a URL. 

10 The following is an example of keywords that may be suggested after entering the a simple key-word 
like "Book" 

• book sales ^ 

• book reviews 

• specialist books 

15 • second hand books 

• used books 

• special edition books 

All of these key-words (phrases) would come from information seekers (users) and information providers 
(web-page developers). The most appropriate keywords will emerge naturally over time. 

20 All keywords used by users are entered into flie key-word link table 696 of Fig. 20. Thus, if people 

enter an uncommon keyword such as "cassetes*' instead of "cassettes" the key-word suggestcr will 
suggest that the user tries "cassettes". There is therefore, no need to create a set of URL-keyword links in 
Table 3 for "cassetes" Thus saving on data space and there is also no need to send a tagged set of results 
for the keyword "cassetes". Hence Actc will be less data sent back to the search engine. 

25 It is also a contemplated embodiment to run the keyword suggester like Table 3 and have high flying 

keyword associations and new keyword associations so the system can learn how keyword associations 
change over time. For example, the keyword suggester trace can store die most recent keyword links and 
modify the main key-word trace by a history factor, in the same way as Table 3 is modified by the 
cumulative surfer trace. 

30 The cumulative keyword surfer trace 698 is processed in the same way as the cumulative web-page 

surfer trace 170 of Fig 5 to refaiforce links between keywords in the keyword link table 696 (Table 10). 
A time variable can also be included so that if a user chooses another keyword very quickly it is assumed 
that the previous keyword was not usefiil and is not counted as a keyword surfer trace. 
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Also, the individual keyword suggester can store, for each user, their personal keyword links. 
Further, the keyword suggester can be based on a number of diflferent profile types. The word 
associations may be quite different for people of different culture, nationality, occupation and age etc. 
Different keyword suggesters can capture the kQ^-word association of different groups of people. The 
5 keyword hits in Table 10 can be subscripted in the same way that the values of X,Y and Z are subscripted 
for different types of profiles in Table 3, as explained previously. 

Using the Tables to create a list of suggested keywords 

Fig. 2 1 illustrates a variety of manners in which a list of suggested keywords can be created. 

One manner is by ranking the values of X in the keyword link table 696 (Table 10). This ranked 

10 list of keywords is combined with keywords fi-om a normal search of keywords, described previously 
with respect to step 646 of Fig. 19. nother manner of suggesting keywords, shown as step 730, is to 
compare the popular list (URLs X values) for the user.entered key.word with the popular-list of other 
key-words in Table 3. A similarity pattern X values in Table 3 indicates that these keywords are similar. 
For example a user may search for "film reviews** and the keyword suggester may come up with "movie 

15 reviews" which has a more comprehensively searched list of sites. In this case there is no physical 
similarity between the words movie and film, but they are linked by the similarity of the patterns of 
URLs links they have in common in Table 3. 

The usefulness of the key word suggester list is enhanced indicated by step 744, by associating with 
each key-word on the suggestion list an indication of whether there are any of the aforementioned 
20 searches available (popular, high flyer, etc.) for that key-word in keyword URL links table 172 of Fig, 5 
(Table 3). The keywords with the most search results are then highlighted. 

Decision to send out tagged keyword suggestions list. 

The security table 168 and keyword link table 696 are used to determine which keyword links to 
sample in a manner similar to that previously described with respect to tagging web pages. As with the 
25 decision for tagging web pages this can depend on whether it is a repeat keyword (found from security 
table 168) and on the frequency of keyword usage (found from keyword table 164), as well as the 
considerations previously discussed. 

Determining other content 

When searching on tiie Internet, various different web pages listings and web pages are displayed as 
30 has been described. One common characteristic of each these different web page listings that have been 
described is that when they are displayed they appear substaiitially identical to one another. As shown in 
Fig. 25, each of the different listings 900, though the text may be different, is otherwise visually identical. 
Other listings 902, however, are many times larger than the listings 900, may include graphical content. 
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and appear more prominent when displayed to the user. Such listings can contain the same content as a 
web page listing, or other content, such as advertisements, pictures, editorials and the like. 

This other content may be displayed to a particular user based upon key-words, user profile type 
(nationality, age .gender, occupation, and so forth) and the time of the day, for example. 

5 In many instances, this content that is displayed along with web page listings is inserted into the 

display area using mechanisms that are diflfercnt from the searching system described previously with 
respect to conventional search engines. Ite mechanism by which this content is displayed m large 
measure based upon some other criteria, such as payment for the space that is used. While the system for 
selecting diis content works, it is difficult to keep track of which content was displayed when, especially 

10 if that content is frequently changed. Thus, another aspect of the present invention, which will now be 
discussed, is a system for tracking changing content, and allowing for content providers to dynamically 
select when their content will be displayed. This dynamic selectable content, as illustrated in Fig. 22, 
may be displayed to the viewer based upon keyword or profile type as entered by the viewer in step 762 
as shown. Within the content selector step 764 that tiien follows, the time of the day is considered and 

15 used in selecting the appropriate content 902 as illustrated in Fig. 25 along with the web page listings 
900. Each content 902 transmitted with the search results made up of web page listings 900 is tagged in 
step 766. Thus, if a user in step 768 selects that content 902, the results of that selection is fed back to 
flie content selector 764 so that the content database associated therewith, can be updated as surfer trace 
data in a manner such as has been previously described. Thereafter, in step 770, that content 902 is 

20 displayed, typically simultaneously with content 900 

In addition to the surfer trace data being input as has been previously described, this content 
embedimenfalso provides for the web page developer, or content provider, to determine the frequency" 
with which this content will be reviewed, and, depending upon the patterns of users with respect to web 
page listings that are viewed, alter the manner in which the content provider's content 902 is displayed 

25 based upon key words, user profile and the like. In order to implement this dynamic content flexibility, 
there arc diree additional data tables, illustrated in Fig. 23, which are used to track the changing content 
902. These tables are keyword content data table 804, personal profile content data table 806; and 
content provider data table 8 12. Keyword content data table 804 is illustrated in more detail in Table 12 
below, and its characteristics are: 

30 H is the cumulative number of hits for one time period for the keyword. This is the number of times 
people choose that keyword; 

• N is the number of times particular content 900 that is associated witha keyword has been sent out for 
display. This is not necessarily the same as H since content associated with a profile type may be have 
a different selection fector than content associated with the keyword. This selection factor can be 
35 various variables, such as votes or price; 
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• A is the selection factor for the keyword from each content provider (e.g. a selection factor could be a 
S bid to be associated with tfiat keyword); 

• T is the total of the selection factors for each keyword and is the sum of A*s; and 

• P is the content value, as determined by votes or price, for each keyword and is T/N (e.g. tiiis could be 
S the S per time content is sent out with that key word- this is a price of being associated with that key 

word) 



Keyword 


Cumulative 
hits for one 
month (H) 


Amount of 
Content 
sent out (N) 


Content 
Provider 1 

(Al) 


Content 
Provide 2 

(A2) 


Total 
(T) 


(P) 


Books 














Fish 















Table 12Keyword content data sets 



This Tkble can also include the maximum content value M that the content provider is prepared to give. 
10 There is no limit to the number of content providers tiiat may attempt to have content 902 displayed with 
a web page listing that is associated with a particular keyword. 

It is possible to have a separate Table 12 for each country or area, so that the content value per 
country or area, per keyword could be different. In addition there could be different content values for 
1 5 different time periods in each country or area. 

It is possible that provider's of content 902 could target both the key-word and the audience by 
identi^ng each of the keywords with target audiences, e.g. the number of hits associated with the word 
rugby could be broken down into the di£ferent profile type s that search for the word rugby. The 
cumulative number of searches for rugby could be 6000 split into 520 under 2rs and 4000 21-50 year 
20 olds and 520 50+ age group. Thus, there may be a dififerent content value for each of these sub classes 
wifliin a keyword search. 

In addition to the key-word dataset 804 it is possible to have a data set of the following type for 
different profile types 806. It contains the same entries for each profile type, instead of keyword as 
described above with respect to the keyword content data table 804 of Fig. 23. 
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Profile type 


cumulative 
hits for one 
monm \ti) 


Amount of 
Content sent 

OUi 

(N) 


Content 
Provider 1 

(Al) 


Content 
Provider 2 

(A2) 


Total 
(J) 


(P) 


Male 














Female 














Professional etc 














Undefined profile 















Table 13 Personal profile content Table 



Table 13 determines the content value of the content 902 to specific audiences of people as opposed to 
different keywords and allows for targeting of specific audiences. 

5 It is within the scope of the present invention to include combination profile types in Table 13 as well, 
such as male, professional or New Zealand, females. The content value for the combined profiles will be 
different than the content value of individual profiles. The mechanics involved in determining the content 
value and choosing the content 902 will be the same, and described further hereinafter. 

0 Content provider data table 812 of Fig. 23 is illustrated in more detail below as Table 14 and 
contains information about fce.contertiLi!ff^^ infbrmation- 

such as the Bitmap (HTML or Java applet or similar) that the content 902 will use and a unique number 

to identify each different item of content 902. 



Name 


Address etc 


Content Information 


Unique number for each Content 


E.g. John 






Content, no. 








Content no. 



Table 14 

15 

This Table may also store details of the content provider, such as passwords, payment details (e.g. credit 
card number and authorization), content delivery (number of times content has been sent to users) etc. 

The data sets for the above mentioned content tables are populated as follows. For the keyword content 
data table 804 
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• H, the cumulative number of hits for a particular key word for one time period, is taten directly &am 
Table 1 (800). 

• N is the number of times content is sent out associated with the k^rword. This is mcrcmented each 
tiine an item of content 902 is displayed to a user that is specifically associated with fiiat keyword 

5 810. 

• The values for A 802 are selected by content providers for each keyword The content provider can 
also enter a maximum value M over which they will no longer select to be sent out with the keyword. 

• T is the total for each keyword and is the sum of As 

P is the content value, as determined by votes or price, for each keyword and is T/M 
10 Populating the personal profile content data 

• H is the cumulative number of hits for each profile type and this information is taken directly fiom 
Table 1 (sum of the mdexed W's). 

• N is the number of items of content 902 sent out associated with the personal profile. This is 
incremented each time an item of content 902 is sent out that is specifically associated with that 

IS profile type 810 

• The values for A 808 are placed, through an entry process akin to bidding, for each profile type. The 
content provider can also enter a maximum M they are prepared to pay, or vote, as the case may be. 

• T is the total for each profile type, and is the sum of As. 

P4s-Aecontent-valuc-foreach-profile-and4s^yN 

20 Populating the content provider's details table 

The majority of the content provider's details 812 are electronically entered by the content 
providers. Each time a content provider's content 902 is sent out fliis event is also recorded in the 
content provider's details Table 812. This will also record the number of click-tooughs 
(820,822,824,826,828) and the cost, in terms of payment or votes, of the content 902. This will form the 

25 basis of the electronic bill or tabulation that is thereafter forwarded to tiie content provider. 
How the data sets are used to select content sent out to users 

In file discussion that follows, with reference to Fig. 24, it is assumed fliat only one banner of 
content 902 is transmitted with each set of web page search results 900. The same algorithms apply if 
there are multiple sets of content transmitted with each set of web page results. 

30 A keyword and profile type are submitted to the search engine in step 852. From keyword content 

data table 804, personal profile content data table 806, fee value of content 902 for each is found from Ihe 
value of P in the Tables. The highest value of P for the keyword or profile type, determined in step 862, 
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detennines the type of content (keyword or profile type) that is transmitted along with the web page 
listings 900. It may be that there is no specific value for the keyword and the user may not be using a 
specific profile type. In this case the values for unassigned content items will be used (from Table 13 for 
users without a profile). Choosing which specific content item 902 is sent out is discussed below. The 
5 details for the content item (their graphics, text, associated programs, etc) are taken from Table 14, 
content provider details table 814 and transmitted to the user in step 868. Details of the content items 902 
transmitted for each content provider are also sent to the content provider, as shown by step 870, at 
regular intervals. 

Determining whether it is keyword or profile content that is transmitted 

10 The type of content 902 transmitted is dependent upon whether it is a key word based content or 

profile option based content. For example a Male from the US may search for fish. The value applicable 
to this search is, keyword = fish, profile = male, profile = US, profile = US, male. When deciding which 
content gets displayed, the system compares the value of the content for all the possibilities (keyword, 
combinations of profile types) and sends out the content that has the most value, as determined in step 

15 862. For example an under 21 male may search using the key-word "Rugby** and the value for the 
associated content for Rugby is O.l per view, whereas the value per view for targeting an under 21 male 
is 0.2 and thus the content targeted at the male under 21 would be displayed rather than the rugby 
content It is important to note that the cumulative frequency of times that content items 902 are 
transmitted (N) will be different to the total cumulative frequency for the targeted area (H). In this 

20 example the cumulative firequency (H) of the number of times 'rugby' is searched for and 'males under 
21' would both incremented by one (via Table 1). However, the number of times an item of content 902 

is displayed would be incremented only for the 'male under 21 ' Table (thi;; is the figure used to 

determine the value of die content per unit view. 

Determining which specific content is transmitted 
25 The example below shows how content associated with flie keyword is selected. It is the same 

process for content associated with profile types. 
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10 


0.52 
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For the key-word "book" the content 902 of content provider 2 would be displayed whenever the 
keyword was searched, as they are the only content provider associated with that key-word. However, for 
the key-word "fish", content providers 1 and 2 would have their content sent out the same number of 
times. In the system scaled to the levels at which it is intended to be used, there will be a very large 
5 number of content providers bidding for different keywords and profile types, 

Calcalatti^ the value of content 

If fliere is a new content provider who, for flie keyword •'bool^" values fiie content at, for instance, $5 
per month, this wiU change the value to 0.075 and fliis will mean that the total associated with the word 
book is $15. Therefore, content provider 2 would now get transmitted 66% of the time (10/15) and the 
10 new content provider would be displayed 33% of the time. The proportion of time ancontent provider's 
content is transmitted is A/T. 

How content provider's nse the data tables 

When bidding for content 902, content providers select a keyword or profile to target their content 
fi-om Tables 12 & 13. The search engine indicates automatically the number of times this search has been 
15 performed for the previous time period (H), the number of times items of content were sent out 
associated with that selection (N) and the value of the content P. 

The new content provider then enters the selection factor A and the system can then instantly 
calculate the new value (P) based on the new total bids (T). The advertiser can also be told the number of 
views per month they are likely to get for their bid (N*(A/T)). These changes are calculated in real-time 

20 to give the new content provider an indication of how their bid will influence the value and the views 
they will rcceiv^foFtfieFbid. If a value andliiSibS of vie^ are agreeablelo the advertiser they can 
choose to submit it as a bid for the defined period, such as a day, week, or month, for instance. The 
details of other content providers are, preferably, not made public. Content providers may also enter a 
maximum value M feey can part with for their content This jMTovides content providers with some 

25 security against paying too much if fee value changes. If the value goes too high then a content 
provider's bid can drop off the list (if P is greater than M ften A is not counted as a bid for that particular 
content provider). The bid would go back on the list if the value went down agam, thus acting as a 
stabilizing mechanism. The content provider can, in a preferred embodiment, be notified by e-mail if 
their content 902 has dropped off the list due to their value limit M . 

30 As shown by the content provider details table 812 of Fig. 24, for instance, content providers thus 

have an account with flic search engine proprietors and procedures for debiting theur account for their 
content is automatically calculated from the account details on a periodic basis.. An electronic statement 
of the number of views, cost per view, number of click-throughs and cost per click-through for each 
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content provider is also forwarded to each content provider, since this information is also stored in 
content provider details table 812 (Table 14). 

In a preferred embodiment, it is possible to identify clusters of similar keywords based on the 
keyword link table. The reason for identifying clusters of keywords is so that content 902 can be targeted 
5 at gsovps of words rather than just individual words. The cluster for the key-word "car" may mclude 
hundreds or thousands of words that have links to the word car (e.g. convertibles, automobiles, vans). 
Statistical clustering techniques are used to define the size and frequency of key-word clusters. This 
makes it a much more automatic process than an editor deciding on clusters of keywords for content 
provider's to target 

10 The same system can be used to set values for keyword clusters. While grouping words in this way 

would incur an increased administration cost, it is nevertheless computationallysimilar and only initiated 
once a certain level of hits on a keyword had been exceeded . 

Ckmtent only search Users can also purposely choose to search only the content provider associated 
with a keyword. In this case the search results will be based on the values of A in Table 12. The content 
15 providers that pay the most will be at the top of the list 

The key-word suggester can also help content providers choose key-words or sets of key-words that 
they would like to display. 

Controlling the search engine system There are a number of parameters that can change the way in 
20 which the search engine according to the present invention ranks web pages. These factors (described in 
detail-below)-are: 



• History factor 

This determines tiie rate of decay of the existing popular lists (popular hit list) as descnT)ed in the 
text previously. This is a number between 1 and 0. A high history fector will make it difficult to 
25 change the existing popularity lists. As an example if the rate of searching for a particular keyword 

is increasing quickly, then the history factor should be lower to enable emerging web pages to rise 
up the popularity list 

• Frequency of updating Table 3 from the cumulative surfer trace 

This is a measure of the frequency with which the popularity lists are updated with information 
30 about the users' activities (i.e. the surfer trace), for example, this may be measured once a day or 

even once a month depending on the rate of change of popularity of particular keyword searches. 

• Sampling frequency 
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This is the frequency of sampling the information of how users are searchmg. If it is a common 
teyword it is not necessaiy to monitor every search. It may be that only a percentage of all 
searches need be monitored to accurately determine web-page popularity. 

• The composition of the default search list (mix of results from the new web-page list, high-flyers 
5 and popular-lists etc.) 

The mix of web pages presented to the user as a default can be changed if necessary to reflect the 
way in which search results evolve over time. 

• Content ^hit&ctor* 

The "content hit fector" is a measure of the weighting given to a hit oncontent being recorded as a 
10 hit for a keyword. The default setting is feat a hit on content counts the same as a hit from the list 

of web pages. The value of content hits can be set higher or lower than unity, depending on the 
price of fee content, e.g. the "content hit factor** may need to be increased for valuable keywords 
as this would decrease the ability to spam these commercially valuable keywords. The higher fee 
content factor, fee higher the resistance to spam as fee search results would be more dependent on 
15 price rafeer than popularity. 

• The time period for content bidding 

Content providers • bid a certain amount for a particular time period e.g. one monfe. This time 
period may be different depending on fee rate-of-change of fee price. If the price is changing 
rapidly or is very stable, fee time period may be respectively shortened or lengfeened 
20 correspondingly. 

• Number of key-words per web-page submission 

This number could be changed to influence how fee system learns from new web pages 
submissions. 

• Lengfe of time between accepting new-web-page submissions 

25 If the date of submission for a web-page is too close to fee existing submission for that web-page, 

feen it is not accepted This lengfe of time can be changed depending on any of fee above factors 

• Number of searches per day, per person (IP address or user ID) feat count as valid hits 
This number can be changed to reduce the possibility of spamming 

• Lengfe of time before renewing fee security Table 
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The security Table that restricts abuse, notes the links between keywords and IP addresses of user 
identifications. The length of time between refreshing this Table can be changed to make it harder 
to spam the system. 

The settings for these factors can be different for different keywords or groups of people depending on: 
5 • Frequency with which searches are done 

• The rate-of-change of frequency of searches 

• The price of the content 

• The rate of change of price of content 

The precise setting of each of these factors will not be known until the system begins operation 
10 ^learning' about the users behaviors. The optimum settings for different situations may be determined by 
experimentation. 

Other appllcationsThough the preferred embodiment has been described with reference to a software 
useable on a computer network for searching the Internet, it will be appreciated that the mvention may be 
readily applied to any search system where a human user chooses results from a set of initial search 
15 results. Such a system may for example be part of an. a LAN or WAN or even a database on an 
individual PC. 

Examples of other possible areas of application for the present invention are described below. 
Intranet searches and other data base searches 

Intranet searches at present suffer from similar drawbacks from Intemet searches,, indeed some 
20 intranets can in themselves be extremely substantial systems, in which identifying a particular 
information source or item can be equally problematic. Utilizing the present invention in such 
applications is within the intended scope of the present invention . 

Searching other media forms 

The present invention is also intended to be applied matching a user's profile to other media sources 
25 (such as pay per-view, television, videos, music and the like), thus allowing content targeted to a 
particular audience. The same form of search lists as described above (Popular-list, High-flyers, Hot-off 
the press, etc) may be employed to direct users to appropriate material. 

Shopping 

The search techniques described herein can be implemented in a consumer network to assist 
30 shoppers in selecting items from within one shop or among a large number of shops. Instead of using a 
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keyword-URL link Table, there would be used a keyword-item purchased link Table, that then records 
what items were purchased after each shopping request (key-word). This embodiment also records where 
the user purchased the product. Each time a shopper purchased an item this would increment the 
popularity of that item, using the same techniques described previously. 

5 The profile type s in this embodiment can be used to record the types of purchases made by different 

sets of people. One could, for example, select a profile type and see what are the most commonly 
purchased items for a range of users, and would provide assistance in choosing gifts for people who have 
a different profile type than yourself 

Scientific publications 

10 Searching scientific data bases (on-line papers, journals, etc.) with the present invention will 

dramatically reduce the time spent examining obscure, or esoteric areas only to find ttic information 
irrelevant. The criteria for a valid hit for such uses would typically incorporate the extended timefeature 
described above to establish the usefiilness of the information source. The refereeing and referencing of 
academic/scientific papers using the present invention could enhanced by classifymg different levels or 

15 types of user, e.g. Dr, Professor etc. postgraduate, and so forth. This will enable users to see. for 
example, what information sources the eminent authorities in a particular field found of interest. It would 
also allows the authors of a paper to become aware of how often their publication was accessed and 
possibly fiirther indicate where and how often the paper was used as a reference in subsequent papers. 
Users may have to formally register with different organizations to obtain levels of ability to referee. 

20 Users may also choose the level of refereeing for flieir searching. 

Online help 

There is currently a substantial global requirement for on-line help and support, particularly for 
computer/software applications. Such a need would be considerably assuaged by use of the present 
invention as the software developers obtain a direct feedback to the type and frequency of particular 
25 inquiries, whilst the users receive the accumulated benefit of the previous users. Different profile type s 
would enable the answers to be provided in an appropriate form for the user , e.g. novice, expert, etc. 
The keyword suggester may, for example, suggest searching with key-words (questions) more likely to 
yield a satisfactory response. There can be a range of answers to each question and as the system learns it 
will converge on to the best answers, 

30 Qaestion and answer services 

Current On-line question/answer programs could be configured to run via flie present invention, thus 
enabling answers to repeatedly asked questions to be based on previous questions and similar questions 
to be suggested. 
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Content optimization on otiier parts of the internet 

The same content bidding mechanism could be used to determine the price of content for any 
location on the Internet, not just web page listings as identified above. In this embodiment, content 
providers will bid for a general content space to set the price automatically. 

5 The profile type information from the search engine could be used as a passport so that other 

advertisements on the Internet could be more targeted to different audiences. This profile type 
information could also be used by web-page developers to customize their web-page for different sets of 
users. 

People matching service 

10 In another embodiment, the system according to the present invention can be used as a dating 

service and/or a method for matching people with similar preferences by doing a statistical analysis to 
compare the individual preferences (Table 6) of groups of users. The individual past preference Tables, 
in this embodiment, would preferably be normalized and compared to each other using a standard 
correlation coefficient When compared to other users it would give a numerical indication of how 

1 5 similar their preferences are. 

The same embodiment could also be iised to find information about similar people from there past 
preferences Tables. For example one could ask to be give the names of people in New Zealand with an 
interest in Ecological Economics and a search could be made of the personal preferences Tables. Such an 
embodiment, however, would typically include a password/consent indicator that provides consent of 
20 identified persons to give out their information, which consent could be given, for example, in only 
certain circumstances, which circumstances are limited to searchers who have a level of authority and 
password indicating the same, or for persons who identify themselves with certain characteristics. 

While the invention has been described in connection with what is presently considered to be the 
most practical and prefenred embodiments, it is understood that the invention is not limited to the 
25 disclosed embodiment. For example, each of the features described above can be use singly or in 
combination, as set forth below in the claims, without other features described above which are 
patentably significant by themselves. Accordingly, the present invention is intended to cover various 
modifications and equivalent anangemcnts included within the spirit and scope of the appended claims. 
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1 . In a computer network having a plurality of user sites, a method of weighting the 
relative importance of a plurality of data items stored in a database on a server computer 
comprising the steps of: 

receiving at said server computer a keyword from a user site; 

generating at said server computer a plurality of listings corresponding to said keyword, 
each listing also corresponding to one of said data items; 

transmitting from said server computer to one of said user sites said plurality of listings; 

detecting at said server computer which ones of said plurality of data items are selected 
by said user site, said user site being transmitted each selected one of said data items upon 
selection of said corresponding listing by said user site; and 

updatmg said database to weight said selected ones of said data items as relatively more 
unportant than unselected ones of said data items with respect to said keyword. 

2, A method according to claim 1 , wherein the weigjit of said selected ones applied 
during said step of updating the database is dependent on the amount of time spent in each of 
said selected ones of said data items. 

3 . A method according to claim 1 , wherein the weight of said selected ones applied 
during said step of the updating the database is dependent on a ranking number associated with 
each selected one of said data items. 

4. A method according to claim 1 , wherein the weight of said selected ones applied 
during said step of updating the database is dependent on the amount of time spent in each of 
said selected ones of said data items and on a ranking number associated with each selected one 
of said data items. 
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S. A method according to claim 1, further including the steps of 

receiving, prior to said step of receiving said keyword, a password identifying a user; 

determining, using said password, if said user is one of a selected group; and 

wherein said step of detecting only occurs if said user is determined to be one of said 
selected group. 



6. A method according to claim 1 v4ierein, associated with each of said data items, 
is n update date; and 

wherein said step of generating generates said plurality of listings based upon data items 
corresponding to said keyword that have been most recently updated, ^ 

7. A method according to claun 1 wherein, associated with each data item, is a 
recent weighting factor X and an old weighting factor Y; and 

wherein said step of generating generates said plurality of listings based upon data items 
that are increasing in popularity the fastest, as determined using said recent weighting factor X 
and said old weighting factor Y. 

8. A method according to claim 1 , further including the steps of 

receiving, prior to sdd step of receiving said keyword, a password identifying a user; and 

wherein said step of generating generates, for said keyword received, as said plurality of 
listings only user specific listings associated with said user, said user specific listings having 
been detected in earlier ones of said detecting steps associated with said user. 

9. A method according to claim 1, wherein, associated with each data item, is a 
plurality of groups; and 
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wherein said step of receiving said keyword also receives an identification of a first of 
said groups; and 

wherein said step of generating generates said plurality of listings firom only those data 
items associated with said first identified group. 



10. A method according to claim 9, \^erein said step of receiving receives an 
identification of a second of said groups; and 

wherein said step of generating generates said plurality of listings from only those data 
items associated with both said first and said second identified groups. 



10 



11. A method according to claim 9, wherein said step of receiving receives an 
identification of a second of said groups; and 

wherein said step of generating generates said plurality of listings from only those data 
1 5 items associated with either said first or said second identified groups. 

12. A method according to claim 10, wherein said step of receiving receives an 
identification of a third of said groups; and 

wherein said step of generating generates said plurality of listings fi-om only those data 
items associated with all of said first, said second and said third identified groups. 

20 

13. A method according to claim 1 1 wherein said step of receiving receives an 
identification of a third of said groups; and 

wherein said step of generatmg generates said plurality of listings from only those data 
items associated with either said first, said second or said third identified groups. 



14. A method according to claim 1, further including the step of determining if said 
keyword is permitted keyword; and 
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wherein said step of generating is only perfoimed if said keyword is a permitted 
keyword. 

15. A method according to claim 1 herein said step of detecting only detects each 
S keyword one time from each user site during a determined interval of time. 

1 6. A method according to claim 1 5, wherein said step of detecting each keyword 
one time includes the steps of: 

associating an identifier with each user site; and 

10 using said identifier to track keywords that have been entered from each of said plurality 

of user sites. 

17. A method according to claim 15, wherein said step of generating uses a history 
factor associated with each keyword in determining said plurality of listings. 

IS 

1 8. A method according to claim 1 6, wherein said history factor is a number less than 
or equal to 1 and greater than or equal to 0. 

19. A method according to claim 1 wherein, associated with each of said data items, 
20 is an update date; and 

wherein said step of generating generates said plurality of listings based upon data items 
corresponding to said keyword that have been most recently updated and data items that have 
been most popular. 



25 20. A method according to claim 1 wherein, associated with each data item, is a 

recent weighting factor X and an old weighting factor Y; and 
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wherein said step of generating generates said plurality of listings based upon data items 
that arc increasing in popularity the fastest, as determined using said recent weighting factor X 
and said old weighting factor Y, and data items that have been most popular. 

5 21. A method according to claim 1 , further including the steps of 

receiving, prior to said step of receiving said keyword, a password identifying a user; and 

wherein said step of generating generates, for said keyword received, as said plurality of 
listings a combination of user specific listings associated wi A said user, said user specific 
listings having been detected in earlier ones of said detecting steps associated with said user, and 
10 data items that have been most popular. 



22. A method according to claim 1 wherein, associated with each data item, is a 
recent weighting factor X and an old weighting factor Y; and . 

1 S herein said step of generating generates said plurality of listings based upon data items 

that are increasing in popularity the fastest, as determined using said recent weighting factor X 
and said old weighting factor Y, and data items corresponding to said keyword that have been 
most recently updated 

20 23. A method according to claim 1 , further including the steps of 

receiving, prior to said step of receiving said keyword, a passwoid identifying a user, and 

wherein said step of generating generates, for said keyword received, as said plurality of 
listings a combination of user specific listings associated with said user, said user specific 
listings having been detected in earlier ones of said detecting steps associated with said user, and 
25 upon data items corresponding to said keyword that have been most recently updated. 



24. A method according to claim 1, further including the steps of 
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receiving, prior to said step of receiving said keyword, a password identifying a user, and 

wherein said step of generating generates, for said keyword received, as said plurality of 
listings a combination of user specific listings associated with said user, said user specific 
listings having been detected in earlier ones of said detecting steps associated with said user, and 
5 based upon data items that are increasing in popularity the fastest, as determined using said 
recent weighting factor X and said old weighting factor Y. 

25. A method according to claim 1 wherein, associated with each of said data items, 
is an update date; and 

10 wherein said step of generating generates said plurality of listings based only upon data 

items corresponding to said keyword that have been most recently updated. 

26. A method accordmg to claim 1 wherein, associated with each data item, is a 
recent weighting factor X and an old weighting factor Y; and 

15 wherein said step of generating generates said plurality of listings based only upon data 

items that are increasing in popularity the fastest, as determined using said recent weighting 
factor X and said old weighting &ctor Y. 

27. A method according to claim 1 wherein, associated with each of said data items, 
20 is an update date; and 

wherein said step of generating generates said plurality of listings based only upon data 
items that have been most popular. 

28. In a computer network having a plurality of user sites and developer sites, a 
25 method of populating a database on a server computer comprising the steps of: 

entering a plurality of data items into said database from said developer sites, each of 
said data items entered into said database including as associated identifiers a plurality of 
associated keywords; and 
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updating said database by entering a plurality of user traces, each of said user traces 
identifying one of said data items and an associated keyword so that each trace increases the 
relative importance of the associated data item with respect to said associated keyword. 

29. A method according to claim 28 wherein said step of entering said plurality of 
data items includes as one of said associated identifiers one of a creation date and an update 
date. 

30. A method according to claim 28 wherein said step of entering said plurality of 
data items includes as one of said associated identifiers a developer site identifier. 

31. A method according to claim 30 wherein said developer site identifier is used to 
prevent said developer site fiom being used during said step of updating for said data items 
entered by said developer site. 

32. A method according to claim 28 wherein each of said user traces includes a user 
identifier; and said user identifier is used to update a user table. 

33. A method according to claim 1 7 wherein said user table includes user traces 
coiiesponding to a plurality of different types of searches performed by the user. 

34. A method according to claim 33 wherein one of the types of searches performed 
by the user corresponds to keyword searches that have been most recently updated. 

35. A method according to claim 17 wherein a particular user has a plurality of xiser 
identifiers, each user identifier corresponding to a different user profile. 
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36. A method according to claim 28 herein said step of updating is perfonned by 
users having a profile that contains a plurality of characteristics, and said iq)dating updates said 
database with respect to each of the plurality of characteristics 

S 37. A method according to claim 28, wherein, associated with each data item, is a 

plurality of groups and each user is associated with some of said plurality of groups; and 

wherein said step of updating, for one user within certain of said plurality of groups, will 
cause updating of the database with respect to said certain of said plurality of groups. 



10 38. A method according to claim 28, wherein, associated with each data item, is a 

plurality of groups and each user is associated with some of said plurality of groups; and, 

wherein said step of updatmg, for one user within certain of said plurality of groups, will 
cause updating of the database with respect to only said certain of said plurality of groups. 



15 



39. A method according to claim 28, herein said user trace will be generated in 
dependence on an amoxmt of time spent by said user in a selected data item that has been 
transmitted to the user site. 



20 40. A method according to claim 39 wherein said user trace will be generated if said 

user spends a predeteimined period of time in said selected data item. 



25 



41 . A method according to claim 28, wherein said user trace will be generated in 
dependence on a ranking number associated with a selected data item that has been transmitted 
to the user. 
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42. A method according to claim 28, wherein said user trace will be generated in 
dependence on an amount of time spent by the user in a selected data item that has been 
transmitted to the user site and on a ranking number associated with the selected data item. 

43 . A method according to claim 28, furthjer including the steps of: 
prior to said step of updating, receiving a password identifying a user; 
determining, using said password, if said user is one of a selected group; and 
wherein said step of updating will not occur if said user is not one of said selected 

group. 

44. A method according to claun 43 wherein said user trace will be generated if said 
user is one of said selected group and spends a predetermined period of time in said selected 
data item. 

45. A method according to claim 28 wherein said step of updating only updates each 
keyword one time for each user during a determined interval of time. 

46. A method according to claim 45, wherein said step of updating each keyword one 
time includes the steps of: 

associating an identifier with each user; and 

udng said identifier to track keywords that have been entered by each user. 
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47. In a computer network having a plurality developer sites, a method of 
detennining content to provide along with other information transmitted from a server computer 
to user sites comprising the steps of: 

obtaining a content listing from each of said plurality of said developer sites, each of said 
5 content listings including content, a developer identifier, a keyword, and a keyword selection 
factor; 

determining a particular keyword from said obtained keywords that is the same for 
different content listings; and 

using the keyword selection factor in determining to transmit certain content associated 
10 with one of said content listings to one of said user sites rather than other content associated with 
another of said content listings. 

48. A method according to claim 47, further including obtaining a profile and a 
profile selection factor for each content listing; 

1 S determining a particular profile fiom said obtained profiles that is the same for different 

content listings; and 

using the profile selection factor in determining to transmit said certain content 
associated with one of said content listings to one of said user sites rather than said other content 
associated with another of sdd content listings. 

20 

49. A method of determining a percentage amount to electronically display content of 
a plurality of content providers to a plurality pf users comprising the steps of: 

receiving a bid amount and an associated particular keyword from each of said plurality 
of content providers, said bid amount corresponding to a value that each said content provider 
25 associates with the particular keyword for displaying a piece of content; 

correlating the bid amounts for the same keyword to determine the percentage amount 
that each piece of content will be electronically displayed to said plurality of users; and 

displaying each said piece of content based in part upon the percentage amount. 



30 



50. A method according to claim 49 wherein said percentage amount is based on a 
period of time during which said particular keyword may be received. 
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51. A method according to claim 49 wherein said percentage amount is based on a 
number of transmissions of said pardcular keyword. 

52. A method according to claim 49 further including the steps of: 
receiving additional bid amounts for said particular keyword; 

recorrelating the bid amounts and the additional bid amounts to determine an updated 
percentage amount that each content providers content will be electronically displayed to said 
plurality of users; and 

displaying each piece of content based in part upon the updated percentage amount. 

53. A method according to claim 52 wherem said updated percentage amount is 
based on a period of time during which said pardcular keyword may be received 

54. A method according to claini 52 wherein said updated percentage amount is 
based on a number of transmissions of said pardcular keyword. 

55. A method according to claim 49 fiirther including the steps of: 

receiving a revised bid amounts for said particular keyword that replaces one of said bid 
amounts; 

recorrelating the bid amounts and the revised bid amounts to determine an updated 
percentage amount that each content providers content will be electronically displayed to said 
plurality of users; and 

displaying each piece of content based in part upon the updated percentage amount. 

56. A method according to claim 55 wherein said updated percentage amount is 
based on a period of time during which said particular keyword may be received 
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57. A method according to claim 55 wherein said updated percentage amount is 
based on a number of transmissions of said particular keyword. 

58. A method of determining a percentage amount to electronically display content of 
a plurality of content providers to a plurality of users comprising the steps of: 

receiving a bid amount and an associated profile fhjm each of said plurality of content 
providers, s£dd bid amount corresponding to a value that each said content provider associates 
with the particular keyword for displaying a piece of content; 

correlating all of the bid amounts for the same profile to determine the percentage 
amount that each piece of content will be electronically displayed to said plurality of users; and 

displaying each said piece of content based upon the percentage amount. 

59. A method according to claim 58 wherein said percentage amount is based on a 
period of time during which said profile may be received. 

60. A method according to claim 58 wherein said percentage amount is based on a 
number of transmissions of said profile. 

61. A method according to claim 58 fiirther mcluding the steps of: 
receiving additional bid amoimts for said profile; 

recorrelating the bid amounts and the additional bid amounts to determine an updated 
percentage amount that each content providers content will be electronically displayed to said 
plurality of users; and 

displaying each piece of content based upon the updated percentage amount 

62. A method according to claim 61 wherein said updated percentage amount is 
based on a period of time during which said profile may be received 

63. A method according to claim 61 wherein said updated percentage amount is 
based on a number of transmissions of said profile. 
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64. A method of detenniiiing which one content to electronically display to one user 
5 &om among a plurality of different contents provided by content providers comprising the steps 
of: 

receiving a bid amount from each of said plurality of content providers, said bid amount 
corresponding to a value for displaying a piece of content that each said content provider 
associates with at least a keyword or a profile of a group of users; 
10 correlating all of the bid amounts to the different keywords and profiles for each different 

bid amount so that it is possible to determine which one content has the highest percentage 
amount for each of the different keywords and profiles; 

receiving one keyword firora one user who falls within said profile of the group of users; 
transmittmg to the one user the one piece of content that has the highest corresponding 
1 5 bid amount based upon that one keyword and the profile of the user. 



65. In a computer network having a plurality of user sites, a method of weighting the 
relative importance of a plurality of keywords stored in a database on a server computer 
20 comprising the steps of: 

receiving at said server computer an initial keyword firom a user site; 

generating at said server computer a plurality of related keywords corresponding to said 
initial keyword; 

transmitting from said server computer to one of said user sites said plurality of related 
25 keywords; 

detecting at said server computer which one of said plurality of related keywords are 
selected by said user site; and 

updating said database to weight a relationship of said selected keyword and said initial 
keyword greater than a relationship of said unselected keywords and said initial keyword. 

30 
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66. A method according to claim 65 wherein there exist a plurality of profiles that 
group said users of said plurality of sites, and wherein the updating step applies a different 
weight to different ones of said plurality of profiles for said selected keyword. 

67. A method according to claim 65, wherein said step of generating also takes into 
account a profile associated with the user site submitting the initial keyword when generating 
the plurality of related keywords. 

68. A method according to claim 65 v^erein the plurality of related keywords are 
stored as a hierarchical directory, such that keywords that are used more often are higher in die 
hierarchy. 

69. A method according to claim 65 wherein the plurality of related keywords is 
generated in part fiom misspellings of said initial keyword. 

70. A method according to claim 65 wherein the plurality of related keywords is 
generated in part from phrases containing said initial keyword. 

71 . A method according to claim 65 wherein the plurality of related keywords is 
generated in part using past preference information from a particular user. 

72. A method according to claim 65 wherein the plurality of related keywords is 
generated in part from listings of keywords that have been most recently updated and keywords 
that have been most popular. 

73. A method according to claim 65, wherein, associated with each keyword is a 
plurality of groups; and 
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wherein said step of receiving said keyword also receives an identification of a first of 
said groups; and 

wherein said step of generating generates said plurality of related keywords from only 
those keywords associated with said first identified group. 

74. A method according to claim 73, wherein said step of receiving receives an 
identification of a second of said groups; and 

wherein said step of generating generates said plurality of related keywords from only 
those keywords associated with both said first and said second identified groups. 
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List of web pages is produced based on a 

random selection of any URL with 
values for X,Y or Z from table 3 for the 
keyword entered 
354) 



The list of web pages is combined 
with the web page details in table 2 
(URL address, description) 
056) 



t 



Resulting set of web 
pages are sent to user 
(358) 



Figure 9 
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Previous favorites search select 
and keyword entrv (372) 



List of web pages is produced 
basedpn a values of X from table 6 
for the keyword entered 



The list of web pages is combined 
with the web page details in table 2 
(URL address, description) 



(374) 




(376) 



Resulting set of web 
pages are sent to user 
(378) 



Figure 10 
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Collective search select and 
kevword entrv f402) 



List of web pages produced that is a mixture 
of conventional, popular, highflier, new, and 
past preferences searches 
(404) 



The list of web pages is combined with the web 
page details In table 2 (URL address, 




description) 
(406) 



Resulting set of web pages are 
sent to user 



(408) 



Figure 11 
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Date search select and keyword 
/date-time entry (432) 



List of web pages produced based 
on tiie nearness of the date to value 
0fZizitabie3 
(434) 



t 



The list of web pages is combined 


with the web page details in table 2 




(URL address, description) 






[436) 








r 






Resulting set of web 






pages are sent to user 






(438) 





Figure 12 
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Custom search select and 
keyword /User ID entrv (462) 



Preferences for how the default 
search results will 5e combine 
(Table 5) 
(464) 



List of web pages produced that is a 
mixture of the popular, highflier, new, 
and past preferences searches as defined 
by a customers predefined preferences 
(466) 





r 


The list of web 
web page deta 


pages is combined with the 
is in table 2 (URL address, 
description) 
(468) 



Resulting set of web pages 
are sent to user 

(470) 



Figure 13 
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keyword and User ID entry 
f490) 



Users default profile (Table 5) 
Eg Male (XI), Student (X5), 
USPC7) 
(492) 




List of web pages produced based 
on die combined values of X (from 
table 3) that match the profile type 
oftheuser(egXl,X5.X7) 
(494) 


► 



The list of web pages is combined 
with the web page details in table 2 
(URL address, description) 
(496) 



Resulting set of web 
pages are sent to user 

(498) 



Figure 14 
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keyword entry 












r 


List of inaccessible 


keyword searches 


(522) 




r 




Keyword entered 






into search engine 






(524) 





Figure 15 
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Keyword, IP address, and User 
ID entry f554'> 




If user has a User ID then tagged 
results are always send out so the 
personal preferences table can be 
updated 

(556) \ 



Keyword and IP, Users ID are 
checked against security table 
7. If it is not a repeat key word 
search then tagged results are 
sent out 
(558) 



Keyword is compared to table I 
If it meets predefmed sampling 
criteria tagged results are sent 
out 

^^60) 




Figure 16 
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User ID, keywords, search types, profile types 
entry (588) ^^^^^^^^ 



Keyword table 
fl64) 



Security table 
ri68) 



Cumulative Surfer 
Trace 
(170) 



Profile Ids 
166) 



Keyword 
URL 
Links 
(172) 



Personal 
Links 
(174) 



URL Table 
n88) 



List of suggested web pages 
(604) 



Figure 17 
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User ID 
entry (6^0) 



Keyword table 
(164) 



Security table 
(168) 



Cumulative Surfer 
Trace 
(170) 



Profile Ids 
(166) ^ 







Personal 






Links 






(174) 


Keyword 






URL 


<— 




Links 






(172) 







URL Table 
(188) 



List of suggested web pages 
f636) 



Figure 18 
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Keyword, profile type 
entry (640) 



Key word surfer trace 
(648) 

-keyword (original) (52) 
-keyword (selected) (652|) 
-IP address (130) 
-User ID (128) 
-Date-tune(132) 




Normal searches 
for keywords 
(646) 



Tagged suggested keywords (660) 



User selects keyword (662) 



User taken to results of selected keyword (664) 



Figure 19 
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Keyword , TP address and, user 
ID entry 684 

/ 



2 



Security table 

ri68i 



Keyword links table 
(696) 



Cumulative keyword trace 
^698) 



z 



Data from keyword suggester trace (648) 
•keyword (original) (52) 
-k^ord (selected) (652) 
• IP address (130) 
-User ED (128) 
-Date time (132) 



Figure 20 
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Click through data (772) 
-keyword (774) 
-profile type(776) 
-date time (778) 
- IP address (780) 
-User ID (782) 
-URL. no. (784) 



Keyword, profile entry 
(762) 



Content selector (764) 



Tagged content sent to user (766) 
^ User selecte content (768) 

User taken to content URL (770) 



Figure 22 
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Keywords and profile entry (800) 



Keyword bids 
{S,maxS) 
(802) 



Keyword 
content data 
sets(804) 




Profile content 
data sets(806} 

^ 



Profile bids 
($,max S) 
(808) 



Data from 
content sent out 
(keyword profile 
type) 
(810) 



Click through data 

-*keyword(820) 
->date-time(822) 
-^IP address(824) 
->userID(826) 
^profile type5(828) 



Details of Content 
Provider-(812) 
(Address- (8 14) 
description of content,(8 1 6) 
payment details)(8 18) 



Figure 23 
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Keyword profile 
(852) 



Keyword content 
data sets (854) 



Value of keyword 
Content 
I (858) 

Determining the type, 
of Content (862)^ 



1 



Determining the specific 

Content (864) 



Content sent to^ 
user 
(868) 



Profile content data 
sets 
(856) 

Value^f profile 
Content 
(860) 



Detaite of Content 
Provider 
Data sets 



Details of Content 
numbers sent to 
Content provider 
(870) 
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Web page listing 900 



Web page listing 900 



Web page listing 900 



Web page listing 900 



WO 911^8028 

) 
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